-
Quantifying efficiency of remote excitation for surface enhanced Raman spectroscopy in molecular junctions
Authors:
Shusen Liao,
Yunxuan Zhu,
Qian Ye,
Stephen Sanders,
Jiawei Yang,
Alessandro Alabastri,
Douglas Natelson
Abstract:
Surface-enhanced Raman spectroscopy (SERS) is enabled by local surface plasmon resonances (LSPRs) in metallic nanogaps. When SERS is excited by direct illumination of the nanogap, the background heating of lattice and electrons can prevent further manipulation of the molecules. To overcome this issue, we report SERS in electromigrated gold molecular junctions excited remotely: surface plasmon pola…
▽ More
Surface-enhanced Raman spectroscopy (SERS) is enabled by local surface plasmon resonances (LSPRs) in metallic nanogaps. When SERS is excited by direct illumination of the nanogap, the background heating of lattice and electrons can prevent further manipulation of the molecules. To overcome this issue, we report SERS in electromigrated gold molecular junctions excited remotely: surface plasmon polaritons (SPPs) are excited at nearby gratings, propagate to the junction, and couple to the local nanogap plasmon modes. Like direct excitation, remote excitation of the nanogap can generate both SERS emission and an open-circuit photovoltage (OCPV). We compare SERS intensity and OCPV in both direct and remote illumination configurations. SERS spectra obtained by remote excitation are much more stable than those obtained through direct excitation when photon count rates are comparable. By statistical analysis of 33 devices, coupling efficiency of remote excitation is calculated to be around 10%, consistent with the simulated energy flow.
△ Less
Submitted 22 August, 2023;
originally announced August 2023.
-
FaceSkin: A Privacy Preserving Facial skin patch Dataset for multi Attributes classification
Authors:
Qiushi Guo,
Shisha Liao
Abstract:
Human facial skin images contain abundant textural information that can serve as valuable features for attribute classification, such as age, race, and gender. Additionally, facial skin images offer the advantages of easy collection and minimal privacy concerns. However, the availability of well-labeled human skin datasets with a sufficient number of images is limited. To address this issue, we in…
▽ More
Human facial skin images contain abundant textural information that can serve as valuable features for attribute classification, such as age, race, and gender. Additionally, facial skin images offer the advantages of easy collection and minimal privacy concerns. However, the availability of well-labeled human skin datasets with a sufficient number of images is limited. To address this issue, we introduce a dataset called FaceSkin, which encompasses a diverse range of ages and races. Furthermore, to broaden the application scenarios, we incorporate synthetic skin-patches obtained from 2D and 3D attack images, including printed paper, replays, and 3D masks. We evaluate the FaceSkin dataset across distinct categories and present experimental results demonstrating its effectiveness in attribute classification, as well as its potential for various downstream tasks, such as Face anti-spoofing and Age estimation.
△ Less
Submitted 9 August, 2023;
originally announced August 2023.
-
A novel approach for quantum financial simulation and quantum state preparation
Authors:
Yen-Jui Chang,
Wei-Ting Wang,
Hao-Yuan Chen,
Shih-Wei Liao,
Ching-Ray Chang
Abstract:
Quantum state preparation is vital in quantum computing and information processing. The ability to accurately and reliably prepare specific quantum states is essential for various applications. One of the promising applications of quantum computers is quantum simulation. This requires preparing a quantum state representing the system we are trying to simulate. This research introduces a novel simu…
▽ More
Quantum state preparation is vital in quantum computing and information processing. The ability to accurately and reliably prepare specific quantum states is essential for various applications. One of the promising applications of quantum computers is quantum simulation. This requires preparing a quantum state representing the system we are trying to simulate. This research introduces a novel simulation algorithm, the multi-Split-Steps Quantum Walk (multi-SSQW), designed to learn and load complicated probability distributions using parameterized quantum circuits (PQC) with a variational solver on classical simulators. The multi-SSQW algorithm is a modified version of the split-steps quantum walk, enhanced to incorporate a multi-agent decision-making process, rendering it suitable for modeling financial markets. The study provides theoretical descriptions and empirical investigations of the multi-SSQW algorithm to demonstrate its promising capabilities in probability distribution simulation and financial market modeling. Harnessing the advantages of quantum computation, the multi-SSQW models complex financial distributions and scenarios with high accuracy, providing valuable insights and mechanisms for financial analysis and decision-making. The multi-SSQW's key benefits include its modeling flexibility, stable convergence, and instantaneous computation. These advantages underscore its rapid modeling and prediction potential in dynamic financial markets.
△ Less
Submitted 20 April, 2024; v1 submitted 3 August, 2023;
originally announced August 2023.
-
The Timeless Timing Argument and the Mass of the Local Group
Authors:
Till Sawala,
Jorge Peñarrubia,
Shihong Liao,
Peter H. Johansson
Abstract:
The Timing Argument connects the motion of a two-body system to its mass in an expanding Universe with a finite age, under the assumption that it has evolved on a self-gravitating orbit. It is commonly applied to the present-day Milky Way-M31 system in order to infer its unknown mass from the measured kinematics. We use a set of Local Group analogues from the Uchuu simulation to investigate the Ti…
▽ More
The Timing Argument connects the motion of a two-body system to its mass in an expanding Universe with a finite age, under the assumption that it has evolved on a self-gravitating orbit. It is commonly applied to the present-day Milky Way-M31 system in order to infer its unknown mass from the measured kinematics. We use a set of Local Group analogues from the Uchuu simulation to investigate the Timing Argument over cosmic time. We find that the median inferred mass remains almost constant over the past 12 Gyr, even while the haloes themselves grew in mass by more than an order of magnitude. By contrast, we find a closer, and nearly time-invariant agreement between the Timing Argument value and the mass within a sphere of radius equal to the MW-M31 separation, and we identify this as the total mass of the system. We conclude that the comparatively close present-day agreement between the Timing Argument and the sum of the halo masses reflects no underlying relation, but merely echoes the fact that the MW and M31 now contain most (but not all) of the mass of the Local Group system.
△ Less
Submitted 18 August, 2023; v1 submitted 25 July, 2023;
originally announced July 2023.
-
Astrometric mass measurement of compact companions in binary systems with Gaia
Authors:
Yilun Wang,
Shilong Liao,
Nicola Giacobbo,
Aleksandra Olejak,
Jian Gao,
Jifeng Liu
Abstract:
For binary systems with an unseen primary and a luminous secondary, the astrometric wobble of the secondary could be used to study the primary. With Gaia, it is possible to measure the mass of the black hole or neutron star with a luminous companion (hereafter BH/NS-LC). Our aim is to provide a method for predicting Gaia's ability in measuring the mass of BH/NS-LCs. We also tried to estimate the n…
▽ More
For binary systems with an unseen primary and a luminous secondary, the astrometric wobble of the secondary could be used to study the primary. With Gaia, it is possible to measure the mass of the black hole or neutron star with a luminous companion (hereafter BH/NS-LC). Our aim is to provide a method for predicting Gaia's ability in measuring the mass of BH/NS-LCs. We also tried to estimate the number of solvable BH/NS-LCs using Gaia. We used a realistic Markov chain Monte Carlo simulation of mock Gaia observations to obtain a relation between the uncertainty of mass measurement of the primary in BH/NS-LCs with the observable variables of the secondary astrometric orbit. Furthermore, we used the MOBSE code to evolve a Galactic BH/NS-LC sample with a combined Milky Way model. Our relation is applied to this sample to estimate the number of solvable BH/NS-LCs. We derived a good relation between the mass uncertainty and the binary parameters. For the first time, we show the quantitive influence of the period P, inclination i, eccentricity e, and ecliptic latitude $β$ to the mass measurement. Our results suggest that $48^{+7}_{-7}$ BH-LCs and $102^{+11}_{10}$ NS-LCs are solvable during a 5 yr Gaia mission. We also give the distribution of the distance and apparent magnitude of the Gaia solvable BH/NS-LCs. This solvable sample would be increased by additional spectroscopic data or a prolonged Gaia mission. The mass uncertainty relation could be used in future simulations of BH/NS-LCs observed by Gaia. The prediction of the solvable BH/NS-LCs is not only influenced by the process in generating the Galactic BH/NS-LC sample, but is also affected by our uncertainty relation. In particular, the relations of parameters such as $[P, e, i, β]$ are very useful to correct the selection effect in the statistic results of the future BH/NS-LC sample observed by Gaia.
△ Less
Submitted 24 July, 2023;
originally announced July 2023.
-
High-rate quantum key distribution exceeding 110 Mb/s
Authors:
Wei Li,
Likang Zhang,
Hao Tan,
Yichen Lu,
Sheng-Kai Liao,
Jia Huang,
Hao Li,
Zhen Wang,
Hao-Kun Mao,
Bingze Yan,
Qiong Li,
Yang Liu,
Qiang Zhang,
Cheng-Zhi Peng,
Lixing You,
Feihu Xu,
Jian-Wei Pan
Abstract:
Quantum key distribution (QKD) can provide fundamentally proven security for secure communication. Toward application, the secret key rate (SKR) is a key figure of merit for any QKD system. So far, the SKR has been limited to about a few megabit-per-second. Here we report a QKD system that is able to generate key at a record high SKR of 115.8 Mb/s over 10-km standard fibre, and to distribute key o…
▽ More
Quantum key distribution (QKD) can provide fundamentally proven security for secure communication. Toward application, the secret key rate (SKR) is a key figure of merit for any QKD system. So far, the SKR has been limited to about a few megabit-per-second. Here we report a QKD system that is able to generate key at a record high SKR of 115.8 Mb/s over 10-km standard fibre, and to distribute key over up to 328 km of ultra-low-loss fibre. This attributes to a multi-pixel superconducting nanowire single-photon detector with ultrahigh counting rate, an integrated transmitter that can stably encode polarization states with low error, a fast post-processing algorithm for generating key in real time and the high system clock-rate operation. The results demonstrate the feasibility of practical high-rate QKD with photonic techniques, thus opening its possibility for widespread applications.
△ Less
Submitted 5 July, 2023;
originally announced July 2023.
-
ProtoDiff: Learning to Learn Prototypical Networks by Task-Guided Diffusion
Authors:
Yingjun Du,
Zehao Xiao,
Shengcai Liao,
Cees Snoek
Abstract:
Prototype-based meta-learning has emerged as a powerful technique for addressing few-shot learning challenges. However, estimating a deterministic prototype using a simple average function from a limited number of examples remains a fragile process. To overcome this limitation, we introduce ProtoDiff, a novel framework that leverages a task-guided diffusion model during the meta-training phase to…
▽ More
Prototype-based meta-learning has emerged as a powerful technique for addressing few-shot learning challenges. However, estimating a deterministic prototype using a simple average function from a limited number of examples remains a fragile process. To overcome this limitation, we introduce ProtoDiff, a novel framework that leverages a task-guided diffusion model during the meta-training phase to gradually generate prototypes, thereby providing efficient class representations. Specifically, a set of prototypes is optimized to achieve per-task prototype overfitting, enabling accurately obtaining the overfitted prototypes for individual tasks. Furthermore, we introduce a task-guided diffusion process within the prototype space, enabling the meta-learning of a generative process that transitions from a vanilla prototype to an overfitted prototype. ProtoDiff gradually generates task-specific prototypes from random noise during the meta-test stage, conditioned on the limited samples available for the new task. Furthermore, to expedite training and enhance ProtoDiff's performance, we propose the utilization of residual prototype learning, which leverages the sparsity of the residual prototype. We conduct thorough ablation studies to demonstrate its ability to accurately capture the underlying prototype distribution and enhance generalization. The new state-of-the-art performance on within-domain, cross-domain, and few-task few-shot classification further substantiates the benefit of ProtoDiff.
△ Less
Submitted 6 November, 2023; v1 submitted 26 June, 2023;
originally announced June 2023.
-
Strange Quasar Candidates with Abnormal Astrometric Characteristics from Gaia EDR3 and SDSS (SQUAB-II): Optical Identifications
Authors:
Xiang Ji,
Zhen-Ya Zheng,
Qiqi Wu,
Ruqiu Lin,
P. T. Rahna,
Yingkang Zhang,
Shuairu Zhu,
Shilong Liao,
Zhaoxiang Qi,
Tao An
Abstract:
There are some strange quasars with multiple Gaia detections or observed with abnormal astrometric characteristics, such as with large proper motions or significant astrometric noises. Those strange quasars could be potential candidates of quasar-star pairs, dual quasars (DQs), or lensed quasars (LQs). Searching for both DQs and LQs is of great importance in many fields of astrophysics. Here in th…
▽ More
There are some strange quasars with multiple Gaia detections or observed with abnormal astrometric characteristics, such as with large proper motions or significant astrometric noises. Those strange quasars could be potential candidates of quasar-star pairs, dual quasars (DQs), or lensed quasars (LQs). Searching for both DQs and LQs is of great importance in many fields of astrophysics. Here in this work, we select 143 SDSS spectroscopically confirmed quasars that have multiple Gaia EDR3 detections within 1 arcsec of the SDSS quasar' position. We apply several optical identification methods to classify this sample. We firstly exclude 65 quasar-star pairs via their stellar features including their parallaxes and proper motions, stellar features in the SDSS spectra, or via the colour-colour diagram. Based on the spectral-fitting results, we find 2 DQ candidates, one of which presents a double-peaked [O III] emission line feature and the other shows a broad $H_β$ velocity offset ($\sim$ 870 $ km s^{-1} $) relative to the [O III] $λ$5007 line. Via the colour difference method, we further find 56 LQ candidates with similar colours in their multiple images. We also cross-match 143 objects with the HST archive and find 19 targets with archival HST images. Our classification results of those 19 targets are mainly consistent with previous works.
△ Less
Submitted 7 July, 2023; v1 submitted 14 June, 2023;
originally announced June 2023.
-
Nearly Optimal Algorithms with Sublinear Computational Complexity for Online Kernel Regression
Authors:
Junfan Li,
Shizhong Liao
Abstract:
The trade-off between regret and computational cost is a fundamental problem for online kernel regression, and previous algorithms worked on the trade-off can not keep optimal regret bounds at a sublinear computational complexity. In this paper, we propose two new algorithms, AOGD-ALD and NONS-ALD, which can keep nearly optimal regret bounds at a sublinear computational complexity, and give suffic…
▽ More
The trade-off between regret and computational cost is a fundamental problem for online kernel regression, and previous algorithms worked on the trade-off can not keep optimal regret bounds at a sublinear computational complexity. In this paper, we propose two new algorithms, AOGD-ALD and NONS-ALD, which can keep nearly optimal regret bounds at a sublinear computational complexity, and give sufficient conditions under which our algorithms work. Both algorithms dynamically maintain a group of nearly orthogonal basis used to approximate the kernel mapping, and keep nearly optimal regret bounds by controlling the approximate error. The number of basis depends on the approximate error and the decay rate of eigenvalues of the kernel matrix. If the eigenvalues decay exponentially, then AOGD-ALD and NONS-ALD separately achieves a regret of $O(\sqrt{L(f)})$ and $O(\mathrm{d}_{\mathrm{eff}}(μ)\ln{T})$ at a computational complexity in $O(\ln^2{T})$. If the eigenvalues decay polynomially with degree $p\geq 1$, then our algorithms keep the same regret bounds at a computational complexity in $o(T)$ in the case of $p>4$ and $p\geq 10$, respectively. $L(f)$ is the cumulative losses of $f$ and $\mathrm{d}_{\mathrm{eff}}(μ)$ is the effective dimension of the problem. The two regret bounds are nearly optimal and are not comparable.
△ Less
Submitted 14 June, 2023;
originally announced June 2023.
-
KETJU -- resolving small-scale supermassive black hole dynamics in GADGET-4
Authors:
Matias Mannerkoski,
Alexander Rawlings,
Peter H. Johansson,
Thorsten Naab,
Antti Rantala,
Volker Springel,
Dimitrios Irodotou,
Shihong Liao
Abstract:
We present the new public version of the KETJU supermassive black hole (SMBH) dynamics module, as implemented into GADGET-4. KETJU adds a small region around each SMBH where the dynamics of the SMBHs and stellar particles are integrated using an algorithmically regularised integrator instead of the leapfrog integrator with gravitational softening used by GADGET-4. This enables modelling SMBHs as p…
▽ More
We present the new public version of the KETJU supermassive black hole (SMBH) dynamics module, as implemented into GADGET-4. KETJU adds a small region around each SMBH where the dynamics of the SMBHs and stellar particles are integrated using an algorithmically regularised integrator instead of the leapfrog integrator with gravitational softening used by GADGET-4. This enables modelling SMBHs as point particles even during close interactions with stellar particles or other SMBHs, effectively removing the spatial resolution limitation caused by gravitational softening. KETJU also includes post-Newtonian corrections, which allows following the dynamics of SMBH binaries to sub-parsec scales and down to tens of Schwarzschild radii. Systems with multiple SMBHs are also supported, with the code also including the leading non-linear cross terms that appear in the post-Newtonian equations for such systems. We present tests of the code showing that it correctly captures, at sufficient mass resolution, the sinking driven by dynamical friction and binary hardening driven by stellar scattering. We also present an example application demonstrating how the code can be applied to study the dynamics of SMBHs in mergers of multiple galaxies and the effect they have on the properties of the surrounding galaxy. We expect that the presented KETJU SMBH dynamics module can also be straightforwardly incorporated into other codes similar to GADGET-4, which would allow coupling small-scale SMBH dynamics to the rich variety of galactic physics models that exist in the literature.
△ Less
Submitted 8 June, 2023;
originally announced June 2023.
-
Short rank-metric codes and scattered subspaces
Authors:
Stefano Lia,
Giovanni Longobardi,
Giuseppe Marino,
Rocco Trombetti
Abstract:
By exploiting the connection between scattered $\mathbb{F}_q$-subspaces of $\mathbb{F}_{q^m}^3$ and minimal non degenerate $3$-dimensional rank metric codes of $\mathbb{F}_{q^m}^{n}$, $n \geq m+2$, described in [2], we will exhibit a new class of codes with parameters $[m+2,3,m-2]_{q^m/q}$ for infinite values of $q$ and $m \geq 5$ odd. Moreover, by studying the geometric structures of these scatte…
▽ More
By exploiting the connection between scattered $\mathbb{F}_q$-subspaces of $\mathbb{F}_{q^m}^3$ and minimal non degenerate $3$-dimensional rank metric codes of $\mathbb{F}_{q^m}^{n}$, $n \geq m+2$, described in [2], we will exhibit a new class of codes with parameters $[m+2,3,m-2]_{q^m/q}$ for infinite values of $q$ and $m \geq 5$ odd. Moreover, by studying the geometric structures of these scattered subspaces, we determine the rank weight distribution of the associated codes.
△ Less
Submitted 10 February, 2024; v1 submitted 2 June, 2023;
originally announced June 2023.
-
RealignDiff: Boosting Text-to-Image Diffusion Model with Coarse-to-fine Semantic Re-alignment
Authors:
Zutao Jiang,
Guian Fang,
Jianhua Han,
Guansong Lu,
Hang Xu,
Shengcai Liao,
Xiaojun Chang,
Xiaodan Liang
Abstract:
Recent advances in text-to-image diffusion models have achieved remarkable success in generating high-quality, realistic images from textual descriptions. However, these approaches have faced challenges in precisely aligning the generated visual content with the textual concepts described in the prompts. In this paper, we propose a two-stage coarse-to-fine semantic re-alignment method, named Reali…
▽ More
Recent advances in text-to-image diffusion models have achieved remarkable success in generating high-quality, realistic images from textual descriptions. However, these approaches have faced challenges in precisely aligning the generated visual content with the textual concepts described in the prompts. In this paper, we propose a two-stage coarse-to-fine semantic re-alignment method, named RealignDiff, aimed at improving the alignment between text and images in text-to-image diffusion models. In the coarse semantic re-alignment phase, a novel caption reward, leveraging the BLIP-2 model, is proposed to evaluate the semantic discrepancy between the generated image caption and the given text prompt. Subsequently, the fine semantic re-alignment stage employs a local dense caption generation module and a re-weighting attention modulation module to refine the previously generated images from a local semantic view. Experimental results on the MS-COCO and ViLG-300 datasets demonstrate that the proposed two-stage coarse-to-fine semantic re-alignment method outperforms other baseline re-alignment techniques by a substantial margin in both visual quality and semantic similarity with the input prompt.
△ Less
Submitted 23 October, 2024; v1 submitted 31 May, 2023;
originally announced May 2023.
-
Engineering the directionality of hot carrier tunneling in plasmonic tunneling structures
Authors:
Mahdiyeh Abbasi,
Shusen Liao,
Yunxuan Zhu,
Douglas Natelson
Abstract:
Tunneling metal-insulator-metal (MIM) junctions can exhibit an open-circuit photovoltage (OCPV) response under illumination that may be useful for photodetection. One mechanism for photovoltage generation is hot carrier tunneling, in which photoexcited carriers generate a net photocurrent that must be balanced by a drift current in the open-circuit configuration. We present experiments in electrom…
▽ More
Tunneling metal-insulator-metal (MIM) junctions can exhibit an open-circuit photovoltage (OCPV) response under illumination that may be useful for photodetection. One mechanism for photovoltage generation is hot carrier tunneling, in which photoexcited carriers generate a net photocurrent that must be balanced by a drift current in the open-circuit configuration. We present experiments in electromigrated planar MIM structures, designed with asymmetric plasmonic properties using Au and Pt electrodes. Decay of optically excited local plasmonic modes preferentially creates hot carriers on the Au side of the junction, leading to a clear preferred directionality of the hot electron photocurrent and hence a preferred polarity of the resulting OCPV. In contrast, in an ensemble of symmetric devices constructed from only one Au, polarity of the OCPV has no preferred direction.
△ Less
Submitted 30 May, 2023;
originally announced May 2023.
-
Large Language Models are Few-Shot Health Learners
Authors:
Xin Liu,
Daniel McDuff,
Geza Kovacs,
Isaac Galatzer-Levy,
Jacob Sunshine,
Jiening Zhan,
Ming-Zher Poh,
Shun Liao,
Paolo Di Achille,
Shwetak Patel
Abstract:
Large language models (LLMs) can capture rich representations of concepts that are useful for real-world tasks. However, language alone is limited. While existing LLMs excel at text-based inferences, health applications require that models be grounded in numerical data (e.g., vital signs, laboratory values in clinical domains; steps, movement in the wellness domain) that is not easily or readily e…
▽ More
Large language models (LLMs) can capture rich representations of concepts that are useful for real-world tasks. However, language alone is limited. While existing LLMs excel at text-based inferences, health applications require that models be grounded in numerical data (e.g., vital signs, laboratory values in clinical domains; steps, movement in the wellness domain) that is not easily or readily expressed as text in existing training corpus. We demonstrate that with only few-shot tuning, a large language model is capable of grounding various physiological and behavioral time-series data and making meaningful inferences on numerous health tasks for both clinical and wellness contexts. Using data from wearable and medical sensor recordings, we evaluate these capabilities on the tasks of cardiac signal analysis, physical activity recognition, metabolic calculation (e.g., calories burned), and estimation of stress reports and mental health screeners.
△ Less
Submitted 24 May, 2023;
originally announced May 2023.
-
Origin of the exotic electronic states in antiferromagnetic NdSb
Authors:
Peng Li,
Tongrui Li,
Sen Liao,
Zhipeng Cao,
Rui Xu,
Yuzhe Wang,
Jianghao Yao,
Shengtao Cui,
Zhe Sun,
Yilin Wang,
Xiangang Wan,
Juan Jiang,
Donglai Feng
Abstract:
Using angle resolved photoemission spectroscopy measurements and first principle calculations, we report that the possible unconventional 2q antiferromagnetic (AFM) order in NdSb can induce unusual modulation on its electronic structure. The obvious extra bands observed in the AFM phase of NdSb are well reproduced by theoretical calculations, in which the Fermi-arc-like structures and sharp extra…
▽ More
Using angle resolved photoemission spectroscopy measurements and first principle calculations, we report that the possible unconventional 2q antiferromagnetic (AFM) order in NdSb can induce unusual modulation on its electronic structure. The obvious extra bands observed in the AFM phase of NdSb are well reproduced by theoretical calculations, in which the Fermi-arc-like structures and sharp extra bands are originated from the in-gap surface states. However, they are demonstrated to be topological trivial. By tuning the chemical potential, the AFM phase of NdSb would go through a topological phase transition, realizing a magnetic topological insulator phase. Hence, our study sheds new light on the rare earth monopnictides for searching unusual AFM structure and the potential of intrinsic magnetic topological materials.
△ Less
Submitted 9 May, 2023;
originally announced May 2023.
-
Biomimetic IGA neuron growth modeling with neurite morphometric features and CNN-based prediction
Authors:
Kuanren Qian,
Ashlee S. Liao,
Shixuan Gu,
Victoria A. Webster-Wood,
Yongjie Jessica Zhang
Abstract:
Neuron growth is a complex, multi-stage process that develops sophisticated morphologies and interwoven neurite networks. Recent advances have enabled us to examine the effects of neuron growth factors and seek causes for neurodegenerative diseases, such as Alzheimer's disease, Parkinson's disease, and amyotrophic lateral sclerosis. A computational tool that studies neuron growth could shed crucia…
▽ More
Neuron growth is a complex, multi-stage process that develops sophisticated morphologies and interwoven neurite networks. Recent advances have enabled us to examine the effects of neuron growth factors and seek causes for neurodegenerative diseases, such as Alzheimer's disease, Parkinson's disease, and amyotrophic lateral sclerosis. A computational tool that studies neuron growth could shed crucial insights into the effects of various factors and help find a neurodegeneration cure. However, there lacks a computational tool to accurately and realistically simulate neuron growth within reasonable time frames. Bio-phenomenon models ignore potential factors and cannot generate realistic results, and bio-physics models require computationally expensive high-order governing equations. This paper incorporates experimental neurite features into a phase field method-based neuron growth model using an isogeometric analysis collocation (IGA-C) approach. Based on a semi-automated quantitative analysis of neurite morphology, we obtain relative turning angle, average tortuosity, neurite endpoints, average segment length, and the total length of neurites. We use the total neurite length to determine the evolving days in vitro (DIV) and select corresponding neurite features to drive and constrain neuron growth. This approach archives biomimetic neuron growth patterns with automatic growth stage transitions by incorporating corresponding DIV neurite morphometric data based on the total neurite length of the evolving neurite morphology. Furthermore, we built a convolutional neural network (CNN) to significantly reduce computational costs for predicting neurite growth. With a customized convolutional autoencoder as the backbone, our CNN model can predict neurite patterns with a high prediction accuracy, 97.77%, while taking 7 orders of magnitude less computational times than our IGA-C solver.
△ Less
Submitted 21 April, 2023;
originally announced April 2023.
-
Deep-Q Learning with Hybrid Quantum Neural Network on Solving Maze Problems
Authors:
Hao-Yuan Chen,
Yen-Jui Chang,
Shih-Wei Liao,
Ching-Ray Chang
Abstract:
Quantum computing holds great potential for advancing the limitations of machine learning algorithms to handle higher dimensions of data and reduce overall training parameters in deep learning (DL) models. This study uses a trainable variational quantum circuit (VQC) on a gate-based quantum computing model to investigate the potential for quantum benefit in a model-free reinforcement learning prob…
▽ More
Quantum computing holds great potential for advancing the limitations of machine learning algorithms to handle higher dimensions of data and reduce overall training parameters in deep learning (DL) models. This study uses a trainable variational quantum circuit (VQC) on a gate-based quantum computing model to investigate the potential for quantum benefit in a model-free reinforcement learning problem. Through a comprehensive investigation and evaluation of the current model and capabilities of quantum computers, we designed and trained a novel hybrid quantum neural network based on the latest Qiskit and PyTorch framework. We compared its performance with a full-classical CNN with and without an incorporated VQC. Our research provides insights into the potential of deep quantum learning to solve a maze problem and, potentially, other reinforcement learning problems. We conclude that reinforcement learning problems can be practical with reasonable training epochs. Moreover, a comparative study of full-classical and hybrid quantum neural networks is discussed to understand these two approaches' performance, advantages, and disadvantages to deep-Q learning problems, especially on larger-scale maze problems larger than 4x4.
△ Less
Submitted 1 December, 2023; v1 submitted 20 April, 2023;
originally announced April 2023.
-
POCE: Pose-Controllable Expression Editing
Authors:
Rongliang Wu,
Yingchen Yu,
Fangneng Zhan,
Jiahui Zhang,
Shengcai Liao,
Shijian Lu
Abstract:
Facial expression editing has attracted increasing attention with the advance of deep neural networks in recent years. However, most existing methods suffer from compromised editing fidelity and limited usability as they either ignore pose variations (unrealistic editing) or require paired training data (not easy to collect) for pose controls. This paper presents POCE, an innovative pose-controlla…
▽ More
Facial expression editing has attracted increasing attention with the advance of deep neural networks in recent years. However, most existing methods suffer from compromised editing fidelity and limited usability as they either ignore pose variations (unrealistic editing) or require paired training data (not easy to collect) for pose controls. This paper presents POCE, an innovative pose-controllable expression editing network that can generate realistic facial expressions and head poses simultaneously with just unpaired training images. POCE achieves the more accessible and realistic pose-controllable expression editing by mapping face images into UV space, where facial expressions and head poses can be disentangled and edited separately. POCE has two novel designs. The first is self-supervised UV completion that allows to complete UV maps sampled under different head poses, which often suffer from self-occlusions and missing facial texture. The second is weakly-supervised UV editing that allows to generate new facial expressions with minimal modification of facial identity, where the synthesized expression could be controlled by either an expression label or directly transplanted from a reference UV map via feature transfer. Extensive experiments show that POCE can learn from unpaired face images effectively, and the learned model can generate realistic and high-fidelity facial expressions under various new poses.
△ Less
Submitted 18 April, 2023;
originally announced April 2023.
-
Simulation of CSSTs astrometric capability
Authors:
Zhensen Fu,
Zhaoxiang Qi,
Shilong Liao,
Xiyan Peng,
Yong Yu,
Qiqi Wu,
Li Shao,
Youhua Xu
Abstract:
The China Space Station Telescope (CSST) will enter a low Earth orbit around 2024 and operate for 10 years, with seven of those years devoted to surveying the area of the median-to-high Galactic latitude and median-to-high Ecliptic latitude of the sky. To maximize the scientific output of CSST, it is important to optimize the survey schedule. We aim to evaluate the astrometric capability of CSST f…
▽ More
The China Space Station Telescope (CSST) will enter a low Earth orbit around 2024 and operate for 10 years, with seven of those years devoted to surveying the area of the median-to-high Galactic latitude and median-to-high Ecliptic latitude of the sky. To maximize the scientific output of CSST, it is important to optimize the survey schedule. We aim to evaluate the astrometric capability of CSST for a given survey schedule and to provide independent suggestions for the optimization of the survey strategy. For this purpose, we first construct the astrometric model and then conduct simulated observations based on the given survey schedule. The astrometric solution is obtained by analyzing the simulated observation data. And then we evaluate the astrometric capability of CSST by analyzing the properties of the astrometric solution. We find that the accuracy of parallax and proper motion of CSST is better than 1 mas( yr1) for the sources of 18-22 mag in g band, and about 1-10 mas( yr1) for the sources of 22-26 mag in g band, respectively. The results from real survey could be worse since the assumptions are optimistic and simple. We find that optimizing the survey schedule can improve the astrometric accuracy of CSST. In the future, we will improve the astrometric capability of CSST by continuously iterating and optimizing the survey schedule.
△ Less
Submitted 4 April, 2023;
originally announced April 2023.
-
Image Blind Denoising Using Dual Convolutional Neural Network with Skip Connection
Authors:
Wencong Wu,
Shicheng Liao,
Guannan Lv,
Peng Liang,
Yungang Zhang
Abstract:
In recent years, deep convolutional neural networks have shown fascinating performance in the field of image denoising. However, deeper network architectures are often accompanied with large numbers of model parameters, leading to high training cost and long inference time, which limits their application in practical denoising tasks. In this paper, we propose a novel dual convolutional blind denoi…
▽ More
In recent years, deep convolutional neural networks have shown fascinating performance in the field of image denoising. However, deeper network architectures are often accompanied with large numbers of model parameters, leading to high training cost and long inference time, which limits their application in practical denoising tasks. In this paper, we propose a novel dual convolutional blind denoising network with skip connection (DCBDNet), which is able to achieve a desirable balance between the denoising effect and network complexity. The proposed DCBDNet consists of a noise estimation network and a dual convolutional neural network (CNN). The noise estimation network is used to estimate the noise level map, which improves the flexibility of the proposed model. The dual CNN contains two branches: a u-shaped sub-network is designed for the upper branch, and the lower branch is composed of the dilated convolution layers. Skip connections between layers are utilized in both the upper and lower branches. The proposed DCBDNet was evaluated on several synthetic and real-world image denoising benchmark datasets. Experimental results have demonstrated that the proposed DCBDNet can effectively remove gaussian noise in a wide range of levels, spatially variant noise and real noise. With a simple model structure, our proposed DCBDNet still can obtain competitive denoising performance compared to the state-of-the-art image denoising models containing complex architectures. Namely, a favorable trade-off between denoising performance and model complexity is achieved. Codes are available at https://github.com/WenCongWu/DCBDNet.
△ Less
Submitted 4 April, 2023;
originally announced April 2023.
-
On the stability and instability of Kelvin-Stuart cat's eyes flows
Authors:
Shasha Liao,
Zhiwu Lin,
Hao Zhu
Abstract:
Kelvin-Stuart vortices are classical mixing layer flows with many applications in fluid mechanics, plasma physics and astrophysics. We prove that the whole family of Kelvin-Stuart vortices is nonlinearly stable for co-periodic perturbations, and linearly unstable for multi-periodic or modulational perturbations. This verifies a long-standing conjecture since the discovery of the Kelvin-Stuart cat'…
▽ More
Kelvin-Stuart vortices are classical mixing layer flows with many applications in fluid mechanics, plasma physics and astrophysics. We prove that the whole family of Kelvin-Stuart vortices is nonlinearly stable for co-periodic perturbations, and linearly unstable for multi-periodic or modulational perturbations. This verifies a long-standing conjecture since the discovery of the Kelvin-Stuart cat's eyes flows in the 1960s. Kelvin-Stuart cat's eyes also appear as magnetic islands which are magnetostatic equilibria for the 2D ideal MHD equations in plasmas. We prove nonlinear stability of Kelvin-Stuart magnetic islands for co-periodic perturbations, and give the first rigorous proof of the coalescence instability, which is important for magnetic reconnection.
△ Less
Submitted 30 December, 2023; v1 submitted 1 April, 2023;
originally announced April 2023.
-
KD-DLGAN: Data Limited Image Generation via Knowledge Distillation
Authors:
Kaiwen Cui,
Yingchen Yu,
Fangneng Zhan,
Shengcai Liao,
Shijian Lu1,
Eric Xing
Abstract:
Generative Adversarial Networks (GANs) rely heavily on large-scale training data for training high-quality image generation models. With limited training data, the GAN discriminator often suffers from severe overfitting which directly leads to degraded generation especially in generation diversity. Inspired by the recent advances in knowledge distillation (KD), we propose KD-DLGAN, a knowledge-dis…
▽ More
Generative Adversarial Networks (GANs) rely heavily on large-scale training data for training high-quality image generation models. With limited training data, the GAN discriminator often suffers from severe overfitting which directly leads to degraded generation especially in generation diversity. Inspired by the recent advances in knowledge distillation (KD), we propose KD-DLGAN, a knowledge-distillation based generation framework that introduces pre-trained vision-language models for training effective data-limited generation models. KD-DLGAN consists of two innovative designs. The first is aggregated generative KD that mitigates the discriminator overfitting by challenging the discriminator with harder learning tasks and distilling more generalizable knowledge from the pre-trained models. The second is correlated generative KD that improves the generation diversity by distilling and preserving the diverse image-text correlation within the pre-trained models. Extensive experiments over multiple benchmarks show that KD-DLGAN achieves superior image generation with limited training data. In addition, KD-DLGAN complements the state-of-the-art with consistent and substantial performance gains.
△ Less
Submitted 30 March, 2023;
originally announced March 2023.
-
Effect of light injection on the security of practical quantum key distribution
Authors:
Liying Han,
Yang Li,
Hao Tan,
Weiyang Zhang,
Wenqi Cai,
Juan Yin,
Jigang Ren,
Feihu Xu,
Shengkai Liao,
Chengzhi Peng
Abstract:
Quantum key distribution (QKD) based on the fundamental laws of quantum physics can allow the distribution of secure keys between distant users. However, the imperfections in realistic devices may lead to potential security risks, which must be accurately characterized and considered in practical security analysis. High-speed optical modulators, being as one of the core components of practical QKD…
▽ More
Quantum key distribution (QKD) based on the fundamental laws of quantum physics can allow the distribution of secure keys between distant users. However, the imperfections in realistic devices may lead to potential security risks, which must be accurately characterized and considered in practical security analysis. High-speed optical modulators, being as one of the core components of practical QKD systems, can be used to prepare the required quantum states. Here, we find that optical modulators based on LiNbO3, including phase modulators and intensity modulators, are vulnerable to photorefractive effect caused by external light injection. By changing the power of external light, eavesdroppers can control the intensities of the prepared states, posing a potential threat to the security of QKD. We have experimentally demonstrated the influence of light injection on LiNbO3-based optical modulators and analyzed the security risks caused by the potential green light injection attack, along with the corresponding countermeasures.
△ Less
Submitted 2 January, 2024; v1 submitted 26 March, 2023;
originally announced March 2023.
-
Response to "Comment on 'Faraday waves in a Hele-Shaw cell' Phys Fluids 30,042106 (2018)"
Authors:
Jing Li,
Xiaochen Li,
Shijun Liao
Abstract:
This is a response to the comment cited as PoF2023,35:029101. We show that the depth will have an impact on the wave height and the scaling law we obtained before is more feasible to use as a prior to give an initial guess of the wave height without any experimental information. This response strongly supports our previous work and the capability of the scaling law.
This is a response to the comment cited as PoF2023,35:029101. We show that the depth will have an impact on the wave height and the scaling law we obtained before is more feasible to use as a prior to give an initial guess of the wave height without any experimental information. This response strongly supports our previous work and the capability of the scaling law.
△ Less
Submitted 22 March, 2023;
originally announced March 2023.
-
Region-wise matching for image inpainting based on adaptive weighted low-rank decomposition
Authors:
Shenghai Liao,
Xuya Liu,
Ruyi Han,
Shujun Fu,
Yuanfeng Zhou,
Yuliang Li
Abstract:
Digital image inpainting is an interpolation problem, inferring the content in the missing (unknown) region to agree with the known region data such that the interpolated result fulfills some prior knowledge. Low-rank and nonlocal self-similarity are two important priors for image inpainting. Based on the nonlocal self-similarity assumption, an image is divided into overlapped square target patche…
▽ More
Digital image inpainting is an interpolation problem, inferring the content in the missing (unknown) region to agree with the known region data such that the interpolated result fulfills some prior knowledge. Low-rank and nonlocal self-similarity are two important priors for image inpainting. Based on the nonlocal self-similarity assumption, an image is divided into overlapped square target patches (submatrices) and the similar patches of any target patch are reshaped as vectors and stacked into a patch matrix. Such a patch matrix usually enjoys a property of low rank or approximately low rank, and its missing entries are recoveried by low-rank matrix approximation (LRMA) algorithms. Traditionally, $n$ nearest neighbor similar patches are searched within a local window centered at a target patch. However, for an image with missing lines, the generated patch matrix is prone to having entirely-missing rows such that the downstream low-rank model fails to reconstruct it well. To address this problem, we propose a region-wise matching (RwM) algorithm by dividing the neighborhood of a target patch into multiple subregions and then search the most similar one within each subregion. A non-convex weighted low-rank decomposition (NC-WLRD) model for LRMA is also proposed to reconstruct all degraded patch matrices grouped by the proposed RwM algorithm. We solve the proposed NC-WLRD model by the alternating direction method of multipliers (ADMM) and analyze the convergence in detail. Numerous experiments on line inpainting (entire-row/column missing) demonstrate the superiority of our method over other competitive inpainting algorithms. Unlike other low-rank-based matrix completion methods and inpainting algorithms, the proposed model NC-WLRD is also effective for removing random-valued impulse noise and structural noise (stripes).
△ Less
Submitted 22 March, 2023;
originally announced March 2023.
-
Self-Paced Learning for Open-Set Domain Adaptation
Authors:
Xinghong Liu,
Yi Zhou,
Tao Zhou,
Jie Qin,
Shengcai Liao
Abstract:
Domain adaptation tackles the challenge of generalizing knowledge acquired from a source domain to a target domain with different data distributions. Traditional domain adaptation methods presume that the classes in the source and target domains are identical, which is not always the case in real-world scenarios. Open-set domain adaptation (OSDA) addresses this limitation by allowing previously un…
▽ More
Domain adaptation tackles the challenge of generalizing knowledge acquired from a source domain to a target domain with different data distributions. Traditional domain adaptation methods presume that the classes in the source and target domains are identical, which is not always the case in real-world scenarios. Open-set domain adaptation (OSDA) addresses this limitation by allowing previously unseen classes in the target domain. Open-set domain adaptation aims to not only recognize target samples belonging to common classes shared by source and target domains but also perceive unknown class samples. We propose a novel framework based on self-paced learning to distinguish common and unknown class samples precisely, referred to as SPLOS (self-paced learning for open-set). To utilize unlabeled target samples for self-paced learning, we generate pseudo labels and design a cross-domain mixup method tailored for OSDA scenarios. This strategy minimizes the noise from pseudo labels and ensures our model progressively learns common class features of the target domain, beginning with simpler examples and advancing to more complex ones. Furthermore, unlike existing OSDA methods that require manual hyperparameter $threshold$ tuning to separate common and unknown classes, our approach self-tunes a suitable threshold, eliminating the need for empirical tuning during testing. Comprehensive experiments illustrate that our method consistently achieves superior performance on different benchmarks compared with various state-of-the-art methods.
△ Less
Submitted 21 March, 2023; v1 submitted 10 March, 2023;
originally announced March 2023.
-
Improved Regret Bounds for Online Kernel Selection under Bandit Feedback
Authors:
Junfan Li,
Shizhong Liao
Abstract:
In this paper, we improve the regret bound for online kernel selection under bandit feedback. Previous algorithm enjoys a $O((\Vert f\Vert^2_{\mathcal{H}_i}+1)K^{\frac{1}{3}}T^{\frac{2}{3}})$ expected bound for Lipschitz loss functions. We prove two types of regret bounds improving the previous bound. For smooth loss functions, we propose an algorithm with a…
▽ More
In this paper, we improve the regret bound for online kernel selection under bandit feedback. Previous algorithm enjoys a $O((\Vert f\Vert^2_{\mathcal{H}_i}+1)K^{\frac{1}{3}}T^{\frac{2}{3}})$ expected bound for Lipschitz loss functions. We prove two types of regret bounds improving the previous bound. For smooth loss functions, we propose an algorithm with a $O(U^{\frac{2}{3}}K^{-\frac{1}{3}}(\sum^K_{i=1}L_T(f^\ast_i))^{\frac{2}{3}})$ expected bound where $L_T(f^\ast_i)$ is the cumulative losses of optimal hypothesis in $\mathbb{H}_{i}=\{f\in\mathcal{H}_i:\Vert f\Vert_{\mathcal{H}_i}\leq U\}$. The data-dependent bound keeps the previous worst-case bound and is smaller if most of candidate kernels match well with the data. For Lipschitz loss functions, we propose an algorithm with a $O(U\sqrt{KT}\ln^{\frac{2}{3}}{T})$ expected bound asymptotically improving the previous bound. We apply the two algorithms to online kernel selection with time constraint and prove new regret bounds matching or improving the previous $O(\sqrt{T\ln{K}} +\Vert f\Vert^2_{\mathcal{H}_i}\max\{\sqrt{T},\frac{T}{\sqrt{\mathcal{R}}}\})$ expected bound where $\mathcal{R}$ is the time budget. Finally, we empirically verify our algorithms on online regression and classification tasks.
△ Less
Submitted 23 March, 2023; v1 submitted 8 March, 2023;
originally announced March 2023.
-
On the geometry of the Hermitian Veronese curve and its quasi-Hermitian surfaces
Authors:
Michel Lavrauw,
Stefano Lia,
Francesco Pavese
Abstract:
The complete classification of the orbits on subspaces under the action of the projective stabiliser of (classical) algebraic varieties is a challenging task, and few classifications are complete. We focus on a particular action of $\PGL(2,q^2)$ (and $\PSL(2,q^2)$) arising from the Hermitian Veronese curve in $\PG(3, q^2)$, a maximal rational curve embedded on a smooth Hermitian surface with some…
▽ More
The complete classification of the orbits on subspaces under the action of the projective stabiliser of (classical) algebraic varieties is a challenging task, and few classifications are complete. We focus on a particular action of $\PGL(2,q^2)$ (and $\PSL(2,q^2)$) arising from the Hermitian Veronese curve in $\PG(3, q^2)$, a maximal rational curve embedded on a smooth Hermitian surface with some fascinating properties. The study of its orbits leads to a new construction of quasi-Hermitian surfaces: sets of points with the same combinatorial and geometric properties as a non-degenerate Hermitian surface.
△ Less
Submitted 3 March, 2023;
originally announced March 2023.
-
Preparing random state for quantum financing with quantum walks
Authors:
Yen-Jui Chang,
Wei-Ting Wang,
Hao-Yuan Chen,
Shih-Wei Liao,
Ching-Ray Chang
Abstract:
In recent years, there has been an emerging trend of combining two innovations in computer science and physics to achieve better computation capability. Exploring the potential of quantum computation to achieve highly efficient performance in various tasks is a vital development in engineering and a valuable question in sciences, as it has a significant potential to provide exponential speedups fo…
▽ More
In recent years, there has been an emerging trend of combining two innovations in computer science and physics to achieve better computation capability. Exploring the potential of quantum computation to achieve highly efficient performance in various tasks is a vital development in engineering and a valuable question in sciences, as it has a significant potential to provide exponential speedups for technologically complex problems that are specifically advantageous to quantum computers. However, one key issue in unleashing this potential is constructing an efficient approach to load classical data into quantum states that can be executed by quantum computers or quantum simulators on classical hardware. Therefore, the split-step quantum walks (SSQW) algorithm was proposed to address this limitation. We facilitate SSQW to design parameterized quantum circuits (PQC) that can generate probability distributions and optimize the parameters to achieve the desired distribution using a variational solver. A practical example of implementing SSQW using Qiskit has been released as open-source software. Showing its potential as a promising method for generating desired probability amplitude distributions highlights the potential application of SSQW in option pricing through quantum simulation.
△ Less
Submitted 10 March, 2023; v1 submitted 24 February, 2023;
originally announced February 2023.
-
Energy-Based Test Sample Adaptation for Domain Generalization
Authors:
Zehao Xiao,
Xiantong Zhen,
Shengcai Liao,
Cees G. M. Snoek
Abstract:
In this paper, we propose energy-based sample adaptation at test time for domain generalization. Where previous works adapt their models to target domains, we adapt the unseen target samples to source-trained models. To this end, we design a discriminative energy-based model, which is trained on source domains to jointly model the conditional distribution for classification and data distribution f…
▽ More
In this paper, we propose energy-based sample adaptation at test time for domain generalization. Where previous works adapt their models to target domains, we adapt the unseen target samples to source-trained models. To this end, we design a discriminative energy-based model, which is trained on source domains to jointly model the conditional distribution for classification and data distribution for sample adaptation. The model is optimized to simultaneously learn a classifier and an energy function. To adapt target samples to source distributions, we iteratively update the samples by energy minimization with stochastic gradient Langevin dynamics. Moreover, to preserve the categorical information in the sample during adaptation, we introduce a categorical latent variable into the energy-based model. The latent variable is learned from the original sample before adaptation by variational inference and fixed as a condition to guide the sample update. Experiments on six benchmarks for classification of images and microblog threads demonstrate the effectiveness of our proposal.
△ Less
Submitted 22 February, 2023;
originally announced February 2023.
-
A Self-Adaptive Algorithm of the Clean Numerical Simulation (CNS) for Chaos
Authors:
Shijie Qin,
Shijun Liao
Abstract:
The background numerical noise $\varepsilon_{0} $ is determined by the maximum of truncation error and round-off error. For a chaotic system, the numerical error $\varepsilon(t)$ grows exponentially, say, $\varepsilon(t) = \varepsilon_{0} \exp(κ\,t)$, where $κ>0$ is the so-called noise-growing exponent. This is the reason why one can not gain a convergent simulation of chaotic systems in a long en…
▽ More
The background numerical noise $\varepsilon_{0} $ is determined by the maximum of truncation error and round-off error. For a chaotic system, the numerical error $\varepsilon(t)$ grows exponentially, say, $\varepsilon(t) = \varepsilon_{0} \exp(κ\,t)$, where $κ>0$ is the so-called noise-growing exponent. This is the reason why one can not gain a convergent simulation of chaotic systems in a long enough interval of time by means of traditional algorithms in double precision, since the background numerical noise $\varepsilon_{0}$ might stop decreasing because of the use of double precision. This restriction can be overcome by means of the clean numerical simulation (CNS), which can decrease the background numerical noise $\varepsilon_{0}$ to any required tiny level. A lot of successful applications show the novelty and validity of the CNS. In this paper, we further propose some strategies to greatly increase the computational efficiency of the CNS algorithms for chaotic dynamical systems. It is highly suggested to keep a balance between truncation error and round-off error and besides to progressively enlarge the background numerical noise $\varepsilon_{0}$, since the exponentially increasing numerical noise $\varepsilon(t)$ is much larger than it. Some examples are given to illustrate the validity of our strategies for the CNS.
△ Less
Submitted 31 January, 2023;
originally announced February 2023.
-
Improved Kernel Alignment Regret Bound for Online Kernel Learning
Authors:
Junfan Li,
Shizhong Liao
Abstract:
In this paper, we improve the kernel alignment regret bound for online kernel learning in the regime of the Hinge loss function. Previous algorithm achieves a regret of $O((\mathcal{A}_TT\ln{T})^{\frac{1}{4}})$ at a computational complexity (space and per-round time) of $O(\sqrt{\mathcal{A}_TT\ln{T}})$, where $\mathcal{A}_T$ is called \textit{kernel alignment}. We propose an algorithm whose regret…
▽ More
In this paper, we improve the kernel alignment regret bound for online kernel learning in the regime of the Hinge loss function. Previous algorithm achieves a regret of $O((\mathcal{A}_TT\ln{T})^{\frac{1}{4}})$ at a computational complexity (space and per-round time) of $O(\sqrt{\mathcal{A}_TT\ln{T}})$, where $\mathcal{A}_T$ is called \textit{kernel alignment}. We propose an algorithm whose regret bound and computational complexity are better than previous results. Our results depend on the decay rate of eigenvalues of the kernel matrix. If the eigenvalues of the kernel matrix decay exponentially, then our algorithm enjoys a regret of $O(\sqrt{\mathcal{A}_T})$ at a computational complexity of $O(\ln^2{T})$. Otherwise, our algorithm enjoys a regret of $O((\mathcal{A}_TT)^{\frac{1}{4}})$ at a computational complexity of $O(\sqrt{\mathcal{A}_TT})$. We extend our algorithm to batch learning and obtain a $O(\frac{1}{T}\sqrt{\mathbb{E}[\mathcal{A}_T]})$ excess risk bound which improves the previous $O(1/\sqrt{T})$ bound.
△ Less
Submitted 13 March, 2024; v1 submitted 25 December, 2022;
originally announced December 2022.
-
Constraining interacting dark energy models with the halo concentration - mass relation
Authors:
Yu Zhao,
Yun Liu,
Shihong Liao,
Jiajun Zhang,
Xiangkun Liu,
Wei Du
Abstract:
The interacting dark energy (IDE) model is a promising alternative cosmological model which has the potential to solve the fine-tuning and coincidence problems by considering the interaction between dark matter and dark energy. Previous studies have shown that the energy exchange between the dark sectors in this model can significantly affect the dark matter halo properties. In this study, utilisi…
▽ More
The interacting dark energy (IDE) model is a promising alternative cosmological model which has the potential to solve the fine-tuning and coincidence problems by considering the interaction between dark matter and dark energy. Previous studies have shown that the energy exchange between the dark sectors in this model can significantly affect the dark matter halo properties. In this study, utilising a large set of cosmological $N$-body simulations, we analyse the redshift evolution of the halo concentration - mass ($c$ - $M$) relation in the IDE model, and show that the $c$ - $M$ relation is a sensitive proxy of the interaction strength parameter $ξ_2$, especially at lower redshifts. Furthermore, we construct parametrized formulae to quantify the dependence of the $c$ - $M$ relation on $ξ_2$ at redshifts ranging from $z=0$ to $0.6$. Our parametrized formulae provide a useful tool in constraining $ξ_2$ with the observational $c$ - $M$ relation. As a first attempt, we use the data from X-ray, gravitational lensing, and galaxy rotational curve observations and obtain a tight constraint on $ξ_2$, i.e. $ξ_2 = 0.071 \pm 0.034$. Our work demonstrates that the halo $c$ - $M$ relation, which reflects the halo assembly history, is a powerful probe to constrain the IDE model.
△ Less
Submitted 5 December, 2022;
originally announced December 2022.
-
JAX-FEM: A differentiable GPU-accelerated 3D finite element solver for automatic inverse design and mechanistic data science
Authors:
Tianju Xue,
Shuheng Liao,
Zhengtao Gan,
Chanwook Park,
Xiaoyu Xie,
Wing Kam Liu,
Jian Cao
Abstract:
This paper introduces JAX-FEM, an open-source differentiable finite element method (FEM) library. Constructed on top of Google JAX, a rising machine learning library focusing on high-performance numerical computing, JAX-FEM is implemented with pure Python while scalable to efficiently solve problems with moderate to large sizes. For example, in a 3D tensile loading problem with 7.7 million degrees…
▽ More
This paper introduces JAX-FEM, an open-source differentiable finite element method (FEM) library. Constructed on top of Google JAX, a rising machine learning library focusing on high-performance numerical computing, JAX-FEM is implemented with pure Python while scalable to efficiently solve problems with moderate to large sizes. For example, in a 3D tensile loading problem with 7.7 million degrees of freedom, JAX-FEM with GPU achieves around 10$\times$ acceleration compared to a commercial FEM code depending on platform. Beyond efficiently solving forward problems, JAX-FEM employs the automatic differentiation technique so that inverse problems are solved in a fully automatic manner without the need to manually derive sensitivities. Examples of 3D topology optimization of nonlinear materials are shown to achieve optimal compliance. Finally, JAX-FEM is an integrated platform for machine learning-aided computational mechanics. We show an example of data-driven multi-scale computations of a composite material where JAX-FEM provides an all-in-one solution from microscopic data generation and model training to macroscopic FE computations. The source code of the library and these examples are shared with the community to facilitate computational mechanics research.
△ Less
Submitted 1 December, 2022;
originally announced December 2022.
-
Unraveling heterogeneity of ADNI's time-to-event data using conditional entropy Part-I: Cross-sectional study
Authors:
Shuting Liao,
Fushing Hsieh
Abstract:
Through Alzheimer's Disease Neuroimaging Initiative (ADNI), time-to-event data: from the pre-dementia state of mild cognitive impairment (MCI) to the diagnosis of Alzheimer's disease (AD), is collected and analyzed by explicitly unraveling prognostic heterogeneity among 346 uncensored and 557 right censored subjects under structural dependency among covariate features. The non-informative censorin…
▽ More
Through Alzheimer's Disease Neuroimaging Initiative (ADNI), time-to-event data: from the pre-dementia state of mild cognitive impairment (MCI) to the diagnosis of Alzheimer's disease (AD), is collected and analyzed by explicitly unraveling prognostic heterogeneity among 346 uncensored and 557 right censored subjects under structural dependency among covariate features. The non-informative censoring mechanism is tested and confirmed based on conditional-vs-marginal entropies evaluated upon contingency tables built by the Redistribute-to-the-right algorithm. The Categorical Exploratory Data Analysis (CEDA) paradigm is applied to evaluate conditional entropy-based associative patterns between the categorized response variable against 16 categorized covariable variables all having 4 categories. Two order-1 global major factors: V9 (MEM-mean) and V8 (ADAS13.bl) are selected sharing the highest amounts of mutual information with the response variable. This heavily censored data set is analyzed by Cox's proportional hazard (PH) modeling. Comparisons of PH and CEDA results on a global scale are complicated under the structural dependency of covariate features. To alleviate such complications, V9 and V8 are taken as two potential perspectives of heterogeneity and the entire collections of subjects are divided into two sets of four sub-collections. CEDA major factor selection protocol is applied to all sub-collections to figure out which features provide extra information. Graphic displays are developed to explicitly unravel conditional entropy expansions upon perspectives of heterogeneity in ADNI data. On the local scale, PH analysis is carried out and results are compared with CEDA's. We conclude that, when facing structural dependency among covariates and heterogeneity in data, CEDA and its major factor selection provide significant merits for manifesting data's multiscale information content.
△ Less
Submitted 28 November, 2022;
originally announced November 2022.
-
Tapping the Potential of Coherence and Syntactic Features in Neural Models for Automatic Essay Scoring
Authors:
Xinying Qiu,
Shuxuan Liao,
Jiajun Xie,
Jian-Yun Nie
Abstract:
In the prompt-specific holistic score prediction task for Automatic Essay Scoring, the general approaches include pre-trained neural model, coherence model, and hybrid model that incorporate syntactic features with neural model. In this paper, we propose a novel approach to extract and represent essay coherence features with prompt-learning NSP that shows to match the state-of-the-art AES coherenc…
▽ More
In the prompt-specific holistic score prediction task for Automatic Essay Scoring, the general approaches include pre-trained neural model, coherence model, and hybrid model that incorporate syntactic features with neural model. In this paper, we propose a novel approach to extract and represent essay coherence features with prompt-learning NSP that shows to match the state-of-the-art AES coherence model, and achieves the best performance for long essays. We apply syntactic feature dense embedding to augment BERT-based model and achieve the best performance for hybrid methodology for AES. In addition, we explore various ideas to combine coherence, syntactic information and semantic embeddings, which no previous study has done before. Our combined model also performs better than the SOTA available for combined model, even though it does not outperform our syntactic enhanced neural model. We further offer analyses that can be useful for future study.
△ Less
Submitted 23 November, 2022;
originally announced November 2022.
-
The growth of intermediate mass black holes through tidal captures and tidal disruption events
Authors:
Francesco Paolo Rizzuto,
Thorsten Naab,
Antti Rantala,
Peter H. Johansson,
Jeremiah P. Ostriker,
Nicholas C. Stone,
Shihong Liao,
Dimitrios Irodotou
Abstract:
We present $N\mathrm{-body} $ simulations, including post-Newtonian dynamics, of dense clusters of low-mass stars harbouring central black holes (BHs) with initial masses of 50, 300, and 2000 $\mathrm{M_{\odot}}$. The models are evolved with the $N\mathrm{-body} $ code \textsc{bifrost} to investigate the possible formation and growth of massive BHs by the tidal capture of stars and tidal disruptio…
▽ More
We present $N\mathrm{-body} $ simulations, including post-Newtonian dynamics, of dense clusters of low-mass stars harbouring central black holes (BHs) with initial masses of 50, 300, and 2000 $\mathrm{M_{\odot}}$. The models are evolved with the $N\mathrm{-body} $ code \textsc{bifrost} to investigate the possible formation and growth of massive BHs by the tidal capture of stars and tidal disruption events (TDEs). We model star-BH tidal interactions using a velocity-dependent drag force, which causes orbital energy and angular momentum loss near the BH. About $\sim 20-30$ per cent of the stars within the spheres of influence of the black holes form Bahcall-Wolf cusps and prevent the systems from core collapse. Within the first 40 Myr of evolution, the systems experience 500 up to 1300 TDEs, depending on the initial cluster structure. Most ($> 95$ per cent) of the TDEs originate from stars in the Bahcall-Wolf cusp. We derive an analytical formula for the TDE rate as a function of the central BH mass, density and velocity dispersion of the clusters ($\dot{N}_{\mathrm{TDE}} \propto M\mathrm{_{BH}} ρσ^{-3}$). We find that TDEs can lead a 300 $\mathrm{M_{\odot}}$ BH to reach $\sim 7000 \mathrm{M_{\odot}}$ within a Gyr. This indicates that TDEs can drive the formation and growth of massive BHs in sufficiently dense environments, which might be present in the central regions of nuclear star clusters.
△ Less
Submitted 23 November, 2022;
originally announced November 2022.
-
Modelling the accretion and feedback of supermassive black hole binaries in gas-rich galaxy mergers
Authors:
Shihong Liao,
Peter H. Johansson,
Matias Mannerkoski,
Dimitrios Irodotou,
Francesco Paolo Rizzuto,
Stuart McAlpine,
Antti Rantala,
Alexander Rawlings,
Till Sawala
Abstract:
We introduce a new model for the accretion and feedback of supermassive black hole (SMBH) binaries to the KETJU code, which enables us to resolve the evolution of SMBH binaries down to separations of tens of Schwarzschild radii in gas-rich galaxy mergers. Our subgrid binary accretion model extends the widely used Bondi--Hoyle--Lyttleton accretion into the binary phase and incorporates preferential…
▽ More
We introduce a new model for the accretion and feedback of supermassive black hole (SMBH) binaries to the KETJU code, which enables us to resolve the evolution of SMBH binaries down to separations of tens of Schwarzschild radii in gas-rich galaxy mergers. Our subgrid binary accretion model extends the widely used Bondi--Hoyle--Lyttleton accretion into the binary phase and incorporates preferential mass accretion onto the secondary SMBH, which is motivated by results from small-scale hydrodynamical circumbinary disc simulations. We perform idealised gas-rich disc galaxy merger simulations using pure thermal or pure kinetic active galactic nuclei (AGN) feedback. Our binary accretion model provides more physically motivated SMBH mass ratios, which are one of the key parameters for computing gravitational wave (GW) induced recoil velocities. The merger time-scales of our simulated SMBH binaries are in the range $t_{\rm merge}{\sim} 10$--$400$ Myr. Prograde in-plane equal-mass galaxy mergers lead to the shortest merger time-scales, as they experience the strongest starbursts, with the ensuing high stellar density resulting in a rapid SMBH coalescence. Compared to the thermal AGN feedback, the kinetic AGN feedback predicts longer merger time-scales and results in more core-like stellar profiles, as it is more effective in removing gas from the galaxy centre and quenching star formation. This suggests that the AGN feedback implementation plays a critical role in modelling SMBH coalescences. Our model will be useful for improving the modelling of SMBH mergers in gas-rich galaxies, the prime targets for the upcoming LISA GW observatory.
△ Less
Submitted 18 February, 2023; v1 submitted 21 November, 2022;
originally announced November 2022.
-
Baryonic Effects on Lagrangian Clustering and Angular Momentum Reconstruction
Authors:
Ming-Jie Sheng,
Hao-Ran Yu,
Sijia Li,
Shihong Liao,
Min Du,
Yunchong Wang,
Peng Wang,
Kun Xu,
Shy Genel,
Dimitrios Irodotou
Abstract:
Recent studies illustrate the correlation between the angular momenta of cosmic structures and their Lagrangian properties. However, only baryons are observable and it is unclear whether they reliably trace the cosmic angular momenta. We study the Lagrangian mass distribution, spin correlation, and predictability of dark matter, gas, and stellar components of galaxy-halo systems using IllustrisTNG…
▽ More
Recent studies illustrate the correlation between the angular momenta of cosmic structures and their Lagrangian properties. However, only baryons are observable and it is unclear whether they reliably trace the cosmic angular momenta. We study the Lagrangian mass distribution, spin correlation, and predictability of dark matter, gas, and stellar components of galaxy-halo systems using IllustrisTNG, and show that the primordial segregations between components are typically small. Their protoshapes are also similar in terms of the statistics of moment of inertia tensors. Under the common gravitational potential they are expected to exert the same tidal torque and the strong spin correlations are not destroyed by the nonlinear evolution and complicated baryonic effects, as confirmed by the high-resolution hydrodynamic simulations. We further show that their late-time angular momenta traced by total gas, stars, or the central galaxies, can be reliably reconstructed by the initial perturbations. These results suggest that baryonic angular momenta can potentially be used in reconstructing the parameters and models related to the initial perturbations.
△ Less
Submitted 4 February, 2023; v1 submitted 9 October, 2022;
originally announced October 2022.
-
Quadratic Constraints for Local Stability Analysis of Quadratic Systems
Authors:
Shih-Chi Liao,
Maziar S. Hemati,
Peter Seiler
Abstract:
This paper proposes new quadratic constraints (QCs) to bound a quadratic polynomial. Such QCs can be used in dissipation ineqaulities to analyze the stability and performance of nonlinear systems with quadratic vector fields. The proposed QCs utilize the sign-indefiniteness of certain classes of quadratic polynomials. These new QCs provide a tight bound on the quadratic terms along specific direct…
▽ More
This paper proposes new quadratic constraints (QCs) to bound a quadratic polynomial. Such QCs can be used in dissipation ineqaulities to analyze the stability and performance of nonlinear systems with quadratic vector fields. The proposed QCs utilize the sign-indefiniteness of certain classes of quadratic polynomials. These new QCs provide a tight bound on the quadratic terms along specific directions. This reduces the conservatism of the QC bounds as compared to the QCs in previous work. Two numerical examples of local stability analysis are provided to demonstrate the effectiveness of the proposed QCs.
△ Less
Submitted 8 September, 2022;
originally announced September 2022.
-
OSC Community Lab: The Integration Test Bed for O-RAN Software Community
Authors:
Fransiscus Asisi Bimo,
Ferlinda Feliana,
Shu-Hua Liao,
Chih-Wei Lin,
David F. Kinsey,
James Li,
Rittwik Jana,
Richard Wright,
Ray-Guang Cheng
Abstract:
O-RAN Software Community (OSC) is an open-source project collaborated by O-RAN Alliance and Linux Foundation, aiming to develop reference software components based on 3GPP and O-RAN Alliance specifications. The OSC has twelve projects. Among them, the Integration and Testing (INT) project is responsible for testing the requirements documented in each release for end-to-end and use case testing. Th…
▽ More
O-RAN Software Community (OSC) is an open-source project collaborated by O-RAN Alliance and Linux Foundation, aiming to develop reference software components based on 3GPP and O-RAN Alliance specifications. The OSC has twelve projects. Among them, the Integration and Testing (INT) project is responsible for testing the requirements documented in each release for end-to-end and use case testing. Three OSC Community Laboratories were built to speed up the integration and interoperability testing among different projects. This paper summarizes the software components developed by OSC projects and the status of the three OSC Community Laboratories. The activities of each laboratory, how the community collaborates, and the challenges we encountered along the way were elaborated.
△ Less
Submitted 31 August, 2022;
originally announced August 2022.
-
Large-scale influence of numerical noises as artificial stochastic disturbances on a sustained turbulence
Authors:
Shijie Qin,
Shijun Liao
Abstract:
We investigate the large-scale influence of numerical noises as tiny artificial stochastic disturbances on a sustained turbulence. Using the two-dimensional (2D) turbulent Rayleigh-Bénard (RB) convection as an example, we numerically solve the NS equations, separately, by means of a traditional algorithm with double precision (marked by RKwD) and the so-called clean numerical simulation (CNS). The…
▽ More
We investigate the large-scale influence of numerical noises as tiny artificial stochastic disturbances on a sustained turbulence. Using the two-dimensional (2D) turbulent Rayleigh-Bénard (RB) convection as an example, we numerically solve the NS equations, separately, by means of a traditional algorithm with double precision (marked by RKwD) and the so-called clean numerical simulation (CNS). The numerical simulation given by the RKwD is a mixture of the "true" physical solution and the "false" numerical noises that is random and can be regarded as a kind of artificial stochastic disturbances: unfortunately, the "true" physical solution is mostly at the same level as the "false" numerical noises. By contrast, the CNS can greatly reduce the background numerical noise to any a required level so that the "false" numerical noises are negligible compared with the "true" physical solution and thus the CNS solution can be used as a "clean" benchmark solution for comparison. It is found that the numerical noises as tiny artificial stochastic disturbances could indeed lead to large-scale deviations of simulations not only in spatio-temporal trajectories but also even in statistics. Especially, these numerical noises (as artificial stochastic disturbances) even lead to different types of flows: the shearing convection occurs for the RKwD simulations, and its corresponding flow field turns to a kind of zonal flow thereafter, however the CNS benchmark solution always sustains the non-shearing vortical/roll-like convection during the whole process of simulation. Thus, we provide a rigorous evidence that numerical noises as a kind of small-scale artificial stochastic disturbances have quantitatively and qualitatively large-scale influences on a sustained turbulence, i.e. the 2D turbulent RB convection considered in this paper.
△ Less
Submitted 19 August, 2022;
originally announced August 2022.
-
Avoiding small denominator problems by means of the homotopy analysis method
Authors:
Shijun Liao
Abstract:
The so-called ``small denominator problem'' was a fundamental problem of dynamics, as pointed out by Poincaré. Small denominators appear most commonly in perturbative theory. The Duffing equation is the simplest example of a non-integrable system exhibiting all problems due to small denominators. In this paper, using the forced Duffing equation as an example, we illustrate that the famous ``small…
▽ More
The so-called ``small denominator problem'' was a fundamental problem of dynamics, as pointed out by Poincaré. Small denominators appear most commonly in perturbative theory. The Duffing equation is the simplest example of a non-integrable system exhibiting all problems due to small denominators. In this paper, using the forced Duffing equation as an example, we illustrate that the famous ``small denominator problems'' never appear if a non-perturbative approach based on the homotopy analysis method (HAM), namely ``the method of directly defining inverse mapping'' (MDDiM), is used. The HAM-based MDDiM provides us great freedom to directly define the inverse operator of an undetermined linear operator so that all small denominators can be completely avoided and besides the convergent series of multiple limit-cycles of the forced Duffing equation with high nonlinearity are successfully obtained. So, from the viewpoint of the HAM, the famous ``small denominator problems'' are only artifacts of perturbation methods. Therefore, completely abandoning perturbation methods but using the HAM-based MDDiM, one would be never troubled by ``small denominators''. The HAM-based MDDiM has general meanings in mathematics and thus can be used to attack many open problems related to the so-called ``small denominators''.
△ Less
Submitted 10 January, 2023; v1 submitted 3 August, 2022;
originally announced August 2022.
-
Gaia Data Release 3: Summary of the content and survey properties
Authors:
Gaia Collaboration,
A. Vallenari,
A. G. A. Brown,
T. Prusti,
J. H. J. de Bruijne,
F. Arenou,
C. Babusiaux,
M. Biermann,
O. L. Creevey,
C. Ducourant,
D. W. Evans,
L. Eyer,
R. Guerra,
A. Hutton,
C. Jordi,
S. A. Klioner,
U. L. Lammers,
L. Lindegren,
X. Luri,
F. Mignard,
C. Panem,
D. Pourbaix,
S. Randich,
P. Sartoretti,
C. Soubiran
, et al. (431 additional authors not shown)
Abstract:
We present the third data release of the European Space Agency's Gaia mission, GDR3. The GDR3 catalogue is the outcome of the processing of raw data collected with the Gaia instruments during the first 34 months of the mission by the Gaia Data Processing and Analysis Consortium. The GDR3 catalogue contains the same source list, celestial positions, proper motions, parallaxes, and broad band photom…
▽ More
We present the third data release of the European Space Agency's Gaia mission, GDR3. The GDR3 catalogue is the outcome of the processing of raw data collected with the Gaia instruments during the first 34 months of the mission by the Gaia Data Processing and Analysis Consortium. The GDR3 catalogue contains the same source list, celestial positions, proper motions, parallaxes, and broad band photometry in the G, G$_{BP}$, and G$_{RP}$ pass-bands already present in the Early Third Data Release. GDR3 introduces an impressive wealth of new data products. More than 33 million objects in the ranges $G_{rvs} < 14$ and $3100 <T_{eff} <14500 $, have new determinations of their mean radial velocities based on data collected by Gaia. We provide G$_{rvs}$ magnitudes for most sources with radial velocities, and a line broadening parameter is listed for a subset of these. Mean Gaia spectra are made available to the community. The GDR3 catalogue includes about 1 million mean spectra from the radial velocity spectrometer, and about 220 million low-resolution blue and red prism photometer BPRP mean spectra. The results of the analysis of epoch photometry are provided for some 10 million sources across 24 variability types. GDR3 includes astrophysical parameters and source class probabilities for about 470 million and 1500 million sources, respectively, including stars, galaxies, and quasars. Orbital elements and trend parameters are provided for some $800\,000$ astrometric, spectroscopic and eclipsing binaries. More than $150\,000$ Solar System objects, including new discoveries, with preliminary orbital solutions and individual epoch observations are part of this release. Reflectance spectra derived from the epoch BPRP spectral data are published for about 60\,000 asteroids. Finally, an additional data set is provided, namely the Gaia Andromeda Photometric Survey (abridged)
△ Less
Submitted 30 July, 2022;
originally announced August 2022.
-
Pseudo-Labeling Based Practical Semi-Supervised Meta-Training for Few-Shot Learning
Authors:
Xingping Dong,
Tianran Ouyang,
Shengcai Liao,
Bo Du,
Ling Shao
Abstract:
Most existing few-shot learning (FSL) methods require a large amount of labeled data in meta-training, which is a major limit. To reduce the requirement of labels, a semi-supervised meta-training (SSMT) setting has been proposed for FSL, which includes only a few labeled samples and numbers of unlabeled samples in base classes. However, existing methods under this setting require class-aware sampl…
▽ More
Most existing few-shot learning (FSL) methods require a large amount of labeled data in meta-training, which is a major limit. To reduce the requirement of labels, a semi-supervised meta-training (SSMT) setting has been proposed for FSL, which includes only a few labeled samples and numbers of unlabeled samples in base classes. However, existing methods under this setting require class-aware sample selection from the unlabeled set, which violates the assumption of unlabeled set. In this paper, we propose a practical semi-supervised meta-training setting with truly unlabeled data to facilitate the applications of FSL in realistic scenarios. To better utilize both the labeled and truly unlabeled data, we propose a simple and effective meta-training framework, called pseudo-labeling based meta-learning (PLML). Firstly, we train a classifier via common semi-supervised learning (SSL) and use it to obtain the pseudo-labels of unlabeled data. Then we build few-shot tasks from labeled and pseudo-labeled data and design a novel finetuning method with feature smoothing and noise suppression to better learn the FSL model from noise labels. Surprisingly, through extensive experiments across two FSL datasets, we find that this simple meta-training framework effectively prevents the performance degradation of various FSL models under limited labeled data, and also significantly outperforms the state-of-the-art SSMT models. Besides, benefiting from meta-training, our method also improves two representative SSL algorithms as well.
△ Less
Submitted 17 May, 2025; v1 submitted 14 July, 2022;
originally announced July 2022.
-
RePFormer: Refinement Pyramid Transformer for Robust Facial Landmark Detection
Authors:
Jinpeng Li,
Haibo Jin,
Shengcai Liao,
Ling Shao,
Pheng-Ann Heng
Abstract:
This paper presents a Refinement Pyramid Transformer (RePFormer) for robust facial landmark detection. Most facial landmark detectors focus on learning representative image features. However, these CNN-based feature representations are not robust enough to handle complex real-world scenarios due to ignoring the internal structure of landmarks, as well as the relations between landmarks and context…
▽ More
This paper presents a Refinement Pyramid Transformer (RePFormer) for robust facial landmark detection. Most facial landmark detectors focus on learning representative image features. However, these CNN-based feature representations are not robust enough to handle complex real-world scenarios due to ignoring the internal structure of landmarks, as well as the relations between landmarks and context. In this work, we formulate the facial landmark detection task as refining landmark queries along pyramid memories. Specifically, a pyramid transformer head (PTH) is introduced to build both homologous relations among landmarks and heterologous relations between landmarks and cross-scale contexts. Besides, a dynamic landmark refinement (DLR) module is designed to decompose the landmark regression into an end-to-end refinement procedure, where the dynamically aggregated queries are transformed to residual coordinates predictions. Extensive experimental results on four facial landmark detection benchmarks and their various subsets demonstrate the superior performance and high robustness of our framework.
△ Less
Submitted 8 July, 2022;
originally announced July 2022.
-
A physical perturbation based study on the prediction of free-fall disks with chaotic modes in the water
Authors:
Tianzhuang Xu,
Bo Zhang,
Jing Li,
Zhihui Li,
Shijuan Liao
Abstract:
We report a phenomenon that physical perturbations sometimes can benefit the certainty of a free-fall motion with chaotic modes, albeit, as commonly believed, they can ruin it. We statistically compare those factors that may lead to uncertainty, by which we find that the growth of the standard deviation of the landing locations is directly determined by the physical perturbations. A significant ya…
▽ More
We report a phenomenon that physical perturbations sometimes can benefit the certainty of a free-fall motion with chaotic modes, albeit, as commonly believed, they can ruin it. We statistically compare those factors that may lead to uncertainty, by which we find that the growth of the standard deviation of the landing locations is directly determined by the physical perturbations. A significant yardstick is defined in the meantime. This temporal criterion is of big relevance to the replicability of such problems experimentally, although they are inherently chaotic. Our hypothesis is verified by experiments from other literature. This outcome also provides a practical strategy to evaluate the credible prediction time by estimating the disturbances from physical parameters as a priori.
△ Less
Submitted 23 June, 2022;
originally announced June 2022.
-
Gaia Data Release 3: Reflectance spectra of Solar System small bodies
Authors:
Gaia Collaboration,
L. Galluccio,
M. Delbo,
F. De Angeli,
T. Pauwels,
P. Tanga,
F. Mignard,
A. Cellino,
A. G. A. Brown,
K. Muinonen,
A. Penttila,
S. Jordan,
A. Vallenari,
T. Prusti,
J. H. J. de Bruijne,
F. Arenou,
C. Babusiaux,
M. Biermann,
O. L. Creevey,
C. Ducourant,
D. W. Evans,
L. Eyer,
R. Guerra,
A. Hutton,
C. Jordi
, et al. (422 additional authors not shown)
Abstract:
The Gaia mission of the European Space Agency (ESA) has been routinely observing Solar System objects (SSOs) since the beginning of its operations in August 2014. The Gaia data release three (DR3) includes, for the first time, the mean reflectance spectra of a selected sample of 60 518 SSOs, primarily asteroids, observed between August 5, 2014, and May 28, 2017. Each reflectance spectrum was deriv…
▽ More
The Gaia mission of the European Space Agency (ESA) has been routinely observing Solar System objects (SSOs) since the beginning of its operations in August 2014. The Gaia data release three (DR3) includes, for the first time, the mean reflectance spectra of a selected sample of 60 518 SSOs, primarily asteroids, observed between August 5, 2014, and May 28, 2017. Each reflectance spectrum was derived from measurements obtained by means of the Blue and Red photometers (BP/RP), which were binned in 16 discrete wavelength bands. We describe the processing of the Gaia spectral data of SSOs, explaining both the criteria used to select the subset of asteroid spectra published in Gaia DR3, and the different steps of our internal validation procedures. In order to further assess the quality of Gaia SSO reflectance spectra, we carried out external validation against SSO reflectance spectra obtained from ground-based and space-borne telescopes and available in the literature. For each selected SSO, an epoch reflectance was computed by dividing the calibrated spectrum observed by the BP/RP at each transit on the focal plane by the mean spectrum of a solar analogue. The latter was obtained by averaging the Gaia spectral measurements of a selected sample of stars known to have very similar spectra to that of the Sun. Finally, a mean of the epoch reflectance spectra was calculated in 16 spectral bands for each SSO. The agreement between Gaia mean reflectance spectra and those available in the literature is good for bright SSOs, regardless of their taxonomic spectral class. We identify an increase in the spectral slope of S-type SSOs with increasing phase angle. Moreover, we show that the spectral slope increases and the depth of the 1 um absorption band decreases for increasing ages of S-type asteroid families.
△ Less
Submitted 24 June, 2022;
originally announced June 2022.
-
Hybrid thermal modeling of additive manufacturing processes using physics-informed neural networks for temperature prediction and parameter identification
Authors:
Shuheng Liao,
Tianju Xue,
Jihoon Jeong,
Samantha Webster,
Kornel Ehmann,
Jian Cao
Abstract:
Understanding the thermal behavior of additive manufacturing (AM) processes is crucial for enhancing the quality control and enabling customized process design. Most purely physics-based computational models suffer from intensive computational costs and the need of calibrating unknown parameters, thus not suitable for online control and iterative design application. Data-driven models taking advan…
▽ More
Understanding the thermal behavior of additive manufacturing (AM) processes is crucial for enhancing the quality control and enabling customized process design. Most purely physics-based computational models suffer from intensive computational costs and the need of calibrating unknown parameters, thus not suitable for online control and iterative design application. Data-driven models taking advantage of the latest developed computational tools can serve as a more efficient surrogate, but they are usually trained over a large amount of simulation data and often fail to effectively use small but high-quality experimental data. In this work, we developed a hybrid physics-based data-driven thermal modeling approach of AM processes using physics-informed neural networks. Specifically, partially observed temperature data measured from an infrared camera is combined with the physics laws to predict full-field temperature history and to discover unknown material and process parameters. In the numerical and experimental examples, the effectiveness of adding auxiliary training data and using the pretrained model on training efficiency and prediction accuracy, as well as the ability to identify unknown parameters with partially observed data, are demonstrated. The results show that the hybrid thermal model can effectively identify unknown parameters and capture the full-field temperature accurately, and thus it has the potential to be used in iterative process design and real-time process control of AM.
△ Less
Submitted 18 January, 2023; v1 submitted 15 June, 2022;
originally announced June 2022.
-
ET White Paper: To Find the First Earth 2.0
Authors:
Jian Ge,
Hui Zhang,
Weicheng Zang,
Hongping Deng,
Shude Mao,
Ji-Wei Xie,
Hui-Gen Liu,
Ji-Lin Zhou,
Kevin Willis,
Chelsea Huang,
Steve B. Howell,
Fabo Feng,
Jiapeng Zhu,
Xinyu Yao,
Beibei Liu,
Masataka Aizawa,
Wei Zhu,
Ya-Ping Li,
Bo Ma,
Quanzhi Ye,
Jie Yu,
Maosheng Xiang,
Cong Yu,
Shangfei Liu,
Ming Yang
, et al. (142 additional authors not shown)
Abstract:
We propose to develop a wide-field and ultra-high-precision photometric survey mission, temporarily named "Earth 2.0 (ET)". This mission is designed to measure, for the first time, the occurrence rate and the orbital distributions of Earth-sized planets. ET consists of seven 30cm telescopes, to be launched to the Earth-Sun's L2 point. Six of these are transit telescopes with a field of view of 500…
▽ More
We propose to develop a wide-field and ultra-high-precision photometric survey mission, temporarily named "Earth 2.0 (ET)". This mission is designed to measure, for the first time, the occurrence rate and the orbital distributions of Earth-sized planets. ET consists of seven 30cm telescopes, to be launched to the Earth-Sun's L2 point. Six of these are transit telescopes with a field of view of 500 square degrees. Staring in the direction that encompasses the original Kepler field for four continuous years, this monitoring will return tens of thousands of transiting planets, including the elusive Earth twins orbiting solar-type stars. The seventh telescope is a 30cm microlensing telescope that will monitor an area of 4 square degrees toward the galactic bulge. This, combined with simultaneous ground-based KMTNet observations, will measure masses for hundreds of long-period and free-floating planets. Together, the transit and the microlensing telescopes will revolutionize our understandings of terrestrial planets across a large swath of orbital distances and free space. In addition, the survey data will also facilitate studies in the fields of asteroseismology, Galactic archeology, time-domain sciences, and black holes in binaries.
△ Less
Submitted 14 June, 2022;
originally announced June 2022.