-
Normal Scalar Curvature Inequality on a Class of Austere Submanifolds
Authors:
Jianquan Ge,
Ya Tao,
Yi Zhou
Abstract:
In this paper, we establish new normal scalar curvature inequalities on a class of austere submanifolds by proving sharper DDVV-type inequalities on associated austere subspaces. We also provide some examples of austere submanifolds in this class and point out one of them achieves the equality in our normal scalar curvature inequality everywhere. As a byproduct, we obtain a Simons-type gap theorem…
▽ More
In this paper, we establish new normal scalar curvature inequalities on a class of austere submanifolds by proving sharper DDVV-type inequalities on associated austere subspaces. We also provide some examples of austere submanifolds in this class and point out one of them achieves the equality in our normal scalar curvature inequality everywhere. As a byproduct, we obtain a Simons-type gap theorem for closed austere submanifolds in unit spheres which belong to that class.
△ Less
Submitted 23 October, 2024;
originally announced October 2024.
-
Dynamic Tuning of Single-Photon Emission in Monolayer WSe2 via Localized Strain Engineering
Authors:
Yi Yu,
Junyu Ge,
Manlin Luo,
In Cheol Seo,
Youngmin Kim,
John J. H. Eng,
Kunze Lu,
Tian-Ran Wei,
Seok Woo Lee,
Weibo Gao,
Hong Li,
Donguk Nam
Abstract:
Two-dimensional (2D) materials have emerged as promising candidates for next-generation integrated single-photon emitters (SPEs). However, significant variability in the emission energies of 2D SPEs presents a major challenge in producing identical single photons from different SPEs, which may become crucial for various quantum applications including quantum information processing. Although variou…
▽ More
Two-dimensional (2D) materials have emerged as promising candidates for next-generation integrated single-photon emitters (SPEs). However, significant variability in the emission energies of 2D SPEs presents a major challenge in producing identical single photons from different SPEs, which may become crucial for various quantum applications including quantum information processing. Although various approaches to dynamically tuning the emission energies of 2D SPEs have been developed to address the issue, the practical solution to matching multiple individual SPEs in a single 2D flake is still scarce. In this work, we demonstrate a precise emission energy tuning of individual SPEs in a WSe2 monolayer. Our approach utilizes localized strain fields near individual SPEs, which we control independently by adjusting the physical volume of an SU-8-based stressor layer via focused laser annealing. This technique allows continuous emission energy tuning of up to 15 meV while maintaining the qualities of SPEs. Additionally, we showcase the precise spectral alignment of three distinct SPEs in a single WSe2 monolayer to the same wavelength. The tunability of 2D SPEs represents a solid step towards the on-chip integrated photonics with 2D materials for quantum technologies.
△ Less
Submitted 2 March, 2025; v1 submitted 23 October, 2024;
originally announced October 2024.
-
Deterministic formation of carbon-functionalized quantum emitters in hexagonal boron nitride
Authors:
Manlin Luo,
Junyu Ge,
Pengru Huang,
Yi Yu,
In Cheol Seo,
Kunze Lu,
Hao Sun,
Jian Kwang Tan,
Sejeong Kim,
Weibo Gao,
Hong Li,
Donguk Nam
Abstract:
Forming single-photon emitters (SPEs) in insulating hexagonal boron nitride (hBN) has sparked wide interests in the quantum photonics. Despite significant progress, it remains challenging to deterministically create SPEs at precise locations with a specific type of element for creating defects. In this study, we present a straightforward approach to generate site-deterministic carbon-functionalize…
▽ More
Forming single-photon emitters (SPEs) in insulating hexagonal boron nitride (hBN) has sparked wide interests in the quantum photonics. Despite significant progress, it remains challenging to deterministically create SPEs at precise locations with a specific type of element for creating defects. In this study, we present a straightforward approach to generate site-deterministic carbon-functionalized quantum emitters in hBN by harnessing ultrasonic nanoindentation. The obtained SPEs are high-quality and can be scaled up to large arrays in a single fabrication step. Comprehensive experimental analyses reveal that the insertion of carbon atoms into the hBN lattice is the source of the robust quantum emission. Complementary theoretical studies suggest possible candidates for the structural origin of the defects based on our experimental results. This rapid and scalable nanoindentation method provides a new way to create SPE arrays with specific types of atoms, enabling the comprehensive investigation of the origins and mechanics of SPE formations in two-dimensional (2D) materials and beyond.
△ Less
Submitted 23 October, 2024;
originally announced October 2024.
-
Multimodal growth and development assessment model
Authors:
Ying Li,
Zichen Song,
Zijie Gong,
Sitan Huang,
Jiewei Ge
Abstract:
With the development of social economy and the improvement of people's attention to health, the growth and development of children and adolescents has become an important indicator to measure the level of national health. Therefore, accurate and timely assessment of children's growth and development has become increasingly important. At the same time, global health inequalities, especially child m…
▽ More
With the development of social economy and the improvement of people's attention to health, the growth and development of children and adolescents has become an important indicator to measure the level of national health. Therefore, accurate and timely assessment of children's growth and development has become increasingly important. At the same time, global health inequalities, especially child malnutrition and stunting in developing countries, urgently require effective assessment tools to monitor and intervene. In recent years, the rapid development of technologies such as big data, artificial intelligence, and cloud computing, and the cross-integration of multiple disciplines such as biomedicine, statistics, and computer science have promoted the rapid development of large-scale models for growth and development assessment. However, there are still problems such as too single evaluation factors, inaccurate diagnostic results, and inability to give accurate and reasonable recommendations. The multi-modal growth and development assessment model uses the public data set of RSNA ( North American College of Radiology ) as the training set, and the data set of the Department of Pediatrics of Huaibei People's Hospital as the open source test set. The embedded ICL module enables the model to quickly adapt and identify the tasks that need to be done to ensure that under the premise of considering multiple evaluation factors, accurate diagnosis results and reasonable medical recommendations are given, so as to provide solutions to the above problems and promote the development of the medical field.
△ Less
Submitted 17 October, 2024;
originally announced October 2024.
-
Biases in galaxy spectral analysis from strong lensing differential magnification effect and correction methods
Authors:
Wenshuo Xu,
Dandan Xu,
Xinzhong Er,
Junqiang Ge
Abstract:
Strong gravitational lensing has significantly advanced the study of high-redshift galaxies, but the differential magnification effect inevitably introduces biases in the spectral analysis of source galaxies. This work investigates these biases using mock lensing systems from MaNGA survey data and IllustrisTNG simulations. We analyze the impact of lensing effect on several spectral properties, inc…
▽ More
Strong gravitational lensing has significantly advanced the study of high-redshift galaxies, but the differential magnification effect inevitably introduces biases in the spectral analysis of source galaxies. This work investigates these biases using mock lensing systems from MaNGA survey data and IllustrisTNG simulations. We analyze the impact of lensing effect on several spectral properties, including stellar age, metallicity, H$α$ flux, and optical emission line ratios. Our results show significant biases in all properties after lensing. The values of quantities can be either over- or under-estimated, except for the consistently enhanced H$α$ flux. The bias varies with lensing configurations and always arises when part of the source galaxy falls into the strong lensing regime. We evaluate two correction methods to recover the intrinsic source properties: the average magnification factor ($\barμ$) and full ray-tracing. While both methods reduce the overestimated H$α$ flux, the $\barμ$ method shows a much larger discrepancy. For stellar population properties and emission line ratios, the $\barμ$ method fails whereas the ray-tracing method proves effective. Applying these two methods to a statistical sample of mock systems further shows their strong dependence on lens modeling accuracy. As a demonstrative study, our results highlight the importance of spatially resolved spectroscopic observations and precise lens modeling for reconstructing spectra of strongly lensed galaxies. While our conclusions are based on a specific source and lens galaxy, further studies with a statistical sample of realistic mock lensing systems are needed for understanding any systematic differences between the two correction methods.
△ Less
Submitted 17 January, 2025; v1 submitted 14 October, 2024;
originally announced October 2024.
-
Mid-infrared group-IV nanowire laser
Authors:
Youngmin Kim,
Simone Assali,
Junyu Ge,
Sebastian Koelling,
Manlin Luo,
Lu Luo,
Hyo-Jun Joo,
James Tan,
Xuncheng Shi,
Zoran Ikonic,
Hong Li,
Oussama Moutanabbir,
Donguk Nam
Abstract:
Semiconductor nanowires have shown great potential for enabling ultra-compact lasers for integrated photonics platforms. Despite the impressive progress in developing nanowire lasers, their integration into Si photonics platforms remains challenging largely due to the use of III-V and II-VI semiconductors as gain media. These materials not only have high material costs, but also require inherently…
▽ More
Semiconductor nanowires have shown great potential for enabling ultra-compact lasers for integrated photonics platforms. Despite the impressive progress in developing nanowire lasers, their integration into Si photonics platforms remains challenging largely due to the use of III-V and II-VI semiconductors as gain media. These materials not only have high material costs, but also require inherently complex integration with Si-based fabrication processing, increasing overall costs and thereby limiting their large-scale adoption. Furthermore, these material-based nanowire lasers rarely emit above 2 um, which is a technologically important wavelength regime for various applications in imaging and quantum sensing. Recently, group-IV nanowires, particularly direct bandgap GeSn nanowires capable of emitting above 2 um, have emerged as promising cost-effective gain media for Si-compatible nanowire lasers, but there has been no successful demonstration of lasing from this seemingly promising nanowire platform. Herein, we report the experimental observation of lasing above 2 um from a single bottom-up grown GeSn nanowire. By harnessing strain engineering and optimized cavity designs simultaneously, the single GeSn nanowire achieves an amplified material gain that can sufficiently overcome minimized optical losses, resulting in a single-mode lasing with an ultra-low threshold of ~5.3 kW cm-2. Our finding paves the way for all-group IV mid-infrared photonic-integrated circuits with compact Si-compatible lasers for on-chip classical and quantum sensing and free-space communication.
△ Less
Submitted 13 February, 2025; v1 submitted 11 October, 2024;
originally announced October 2024.
-
Moving Faster and Reducing Risk: Using LLMs in Release Deployment
Authors:
Rui Abreu,
Vijayaraghavan Murali,
Peter C Rigby,
Chandra Maddila,
Weiyan Sun,
Jun Ge,
Kaavya Chinniah,
Audris Mockus,
Megh Mehta,
Nachiappan Nagappan
Abstract:
Release engineering has traditionally focused on continuously delivering features and bug fixes to users, but at a certain scale, it becomes impossible for a release engineering team to determine what should be released. At Meta's scale, the responsibility appropriately and necessarily falls back on the engineer writing and reviewing the code. To address this challenge, we developed models of diff…
▽ More
Release engineering has traditionally focused on continuously delivering features and bug fixes to users, but at a certain scale, it becomes impossible for a release engineering team to determine what should be released. At Meta's scale, the responsibility appropriately and necessarily falls back on the engineer writing and reviewing the code. To address this challenge, we developed models of diff risk scores (DRS) to determine how likely a diff is to cause a SEV, i.e., a severe fault that impacts end-users. Assuming that SEVs are only caused by diffs, a naive model could randomly gate X% of diffs from landing, which would automatically catch X% of SEVs on average. However, we aimed to build a model that can capture Y% of SEVs by gating X% of diffs, where Y >> X. By training the model on historical data on diffs that have caused SEVs in the past, we can predict the riskiness of an outgoing diff to cause a SEV. Diffs that are beyond a particular threshold of risk can then be gated. We have four types of gating: no gating (green), weekend gating (weekend), medium impact on end-users (yellow), and high impact on end-users (red). The input parameter for our models is the level of gating, and the outcome measure is the number of captured SEVs. Our research approaches include a logistic regression model, a BERT-based model, and generative LLMs. Our baseline regression model captures 18.7%, 27.9%, and 84.6% of SEVs while respectively gating the top 5% (weekend), 10% (yellow), and 50% (red) of risky diffs. The BERT-based model, StarBERT, only captures 0.61x, 0.85x, and 0.81x as many SEVs as the logistic regression for the weekend, yellow, and red gating zones, respectively. The generative LLMs, iCodeLlama-34B and iDiffLlama-13B, when risk-aligned, capture more SEVs than the logistic regression model in production: 1.40x, 1.52x, 1.05x, respectively.
△ Less
Submitted 8 October, 2024;
originally announced October 2024.
-
Equivalence of pseudogap and pairing energy in a cuprate high-temperature superconductor
Authors:
Jiasen Niu,
Maialen Ortego Larrazabal,
Thomas Gozlinski,
Yudai Sato,
Koen M. Bastiaans,
Tjerk Benschop,
Jian-Feng Ge,
Yaroslav M. Blanter,
Genda Gu,
Ingmar Swart,
Milan P. Allan
Abstract:
The pseudogap stands out in the phase diagram of the cuprate high-temperature superconductors because its origin and relationship to superconductivity remain elusive. The origin of the pseudogap has been debated, with competing hypotheses attributing it to preformed electron pairs or local order, such as charge density waves. Here, we present unambiguous evidence supporting the pairing scenario, u…
▽ More
The pseudogap stands out in the phase diagram of the cuprate high-temperature superconductors because its origin and relationship to superconductivity remain elusive. The origin of the pseudogap has been debated, with competing hypotheses attributing it to preformed electron pairs or local order, such as charge density waves. Here, we present unambiguous evidence supporting the pairing scenario, using local shot-noise spectroscopy measurements in Bi2Sr2CaCu2O8+δ. Our data demonstrates that the pseudogap energy coincides with the onset of electron pairing, and is spatially heterogeneous with values reaching up to 70 meV. Our results exclude a pure local order origin of the pseudogap, link the pseudogap to Cooper pair formation, and show that the limiting factor for higher Tc in cuprates is phase coherence.
△ Less
Submitted 24 September, 2024;
originally announced September 2024.
-
Adaptive radar detection of subspace-based distributed target in power heterogeneous clutter
Authors:
Daipeng Xiao,
Weijian Liu,
Jun Liu,
Lingyan Dai,
Xueli Fang,
Jianjun Ge
Abstract:
This paper investigates the problem of adaptive detection of distributed targets in power heterogeneous clutter. In the considered scenario, all the data share the identical structure of clutter covariance matrix, but with varying and unknown power mismatches. To address this problem, we iteratively estimate all the unknowns, including the coordinate matrix of the target, the clutter covariance ma…
▽ More
This paper investigates the problem of adaptive detection of distributed targets in power heterogeneous clutter. In the considered scenario, all the data share the identical structure of clutter covariance matrix, but with varying and unknown power mismatches. To address this problem, we iteratively estimate all the unknowns, including the coordinate matrix of the target, the clutter covariance matrix, and the corresponding power mismatches, and propose three detectors based on the generalized likelihood ratio test (GLRT), Rao and the Wald tests. The results from simulated and real data both illustrate that the detectors based on GLRT and Rao test have higher probabilities of detection (PDs) than the existing competitors. Among them, the Rao test-based detector exhibits the best overall detection performance. We also analyze the impact of the target extended dimensions, the signal subspace dimensions, and the number of training samples on the detection performance. Furthermore, simulation experiments also demonstrate that the proposed detectors have a constant false alarm rate (CFAR) property for the structure of clutter covariance matrix.
△ Less
Submitted 9 October, 2024; v1 submitted 21 September, 2024;
originally announced September 2024.
-
Rapid Automatic Multiple Moving Objects Detection Method Based on Feature Extraction from Images with Non-sidereal Tracking
Authors:
Lei Wang,
Xiaoming Zhang,
Chunhai Bai,
Haiwen Xie,
Juan Li,
Jiayi Ge,
Jianfeng Wang,
Xianqun Zeng,
Jiantao Sun,
Xiaojun Jiang
Abstract:
Optically observing and monitoring moving objects, both natural and artificial, is important to human space security. Non-sidereal tracking can improve the system's limiting magnitude for moving objects, which benefits the surveillance. However, images with non-sidereal tracking include complex background, as well as objects with different brightness and moving mode, posing a significant challenge…
▽ More
Optically observing and monitoring moving objects, both natural and artificial, is important to human space security. Non-sidereal tracking can improve the system's limiting magnitude for moving objects, which benefits the surveillance. However, images with non-sidereal tracking include complex background, as well as objects with different brightness and moving mode, posing a significant challenge for accurate multi-object detection in such images, especially in wide field of view (WFOV) telescope images. To achieve a higher detection precision in a higher speed, we proposed a novel object detection method, which combines the source feature extraction and the neural network. First, our method extracts object features from optical images such as centroid, shape, and flux. Then it conducts a naive labeling based on those features to distinguish moving objects from stars. After balancing the labeled data, we employ it to train a neural network aimed at creating a classification model for point-like and streak-like objects. Ultimately, based on the neural network model's classification outcomes, moving objects whose motion modes consistent with the tracked objects are detected via track association, while objects with different motion modes are detected using morphological statistics. The validation, based on the space objects images captured in target tracking mode with the 1-meter telescope at Nanshan, Xinjiang Astronomical Observatory, demonstrates that our method achieves 94.72% detection accuracy with merely 5.02% false alarm rate, and a processing time of 0.66s per frame. Consequently, our method can rapidly and accurately detect objects with different motion modes from wide-field images with non-sidereal tracking.
△ Less
Submitted 3 September, 2024;
originally announced September 2024.
-
RSTeller: Scaling Up Visual Language Modeling in Remote Sensing with Rich Linguistic Semantics from Openly Available Data and Large Language Models
Authors:
Junyao Ge,
Xu Zhang,
Yang Zheng,
Kaitai Guo,
Jimin Liang
Abstract:
Abundant, well-annotated multimodal data in remote sensing are pivotal for aligning complex visual remote sensing (RS) scenes with human language, enabling the development of specialized vision language models across diverse RS interpretation tasks. However, annotating RS images with rich linguistic semantics at scale demands expertise in RS and substantial human labor, making it costly and often…
▽ More
Abundant, well-annotated multimodal data in remote sensing are pivotal for aligning complex visual remote sensing (RS) scenes with human language, enabling the development of specialized vision language models across diverse RS interpretation tasks. However, annotating RS images with rich linguistic semantics at scale demands expertise in RS and substantial human labor, making it costly and often impractical. In this study, we propose a workflow that leverages large language models (LLMs) to generate multimodal datasets with semantically rich captions at scale from plain OpenStreetMap (OSM) data for images sourced from the Google Earth Engine (GEE) platform. This approach facilitates the generation of paired remote sensing data and can be readily scaled up using openly available data. Within this framework, we present RSTeller, a multimodal dataset comprising over 1.3 million RS images, each accompanied by two descriptive captions. Extensive experiments demonstrate that RSTeller enhances the performance of multiple existing vision language models for RS scene understanding through continual pre-training. Our methodology significantly reduces the manual effort and expertise needed for annotating remote sensing imagery while democratizing access to high-quality annotated data. This advancement fosters progress in visual language modeling and encourages broader participation in remote sensing research and applications. The RSTeller dataset is available at https://github.com/SlytherinGe/RSTeller.
△ Less
Submitted 16 April, 2025; v1 submitted 26 August, 2024;
originally announced August 2024.
-
ESA: Annotation-Efficient Active Learning for Semantic Segmentation
Authors:
Jinchao Ge,
Zeyu Zhang,
Minh Hieu Phan,
Bowen Zhang,
Akide Liu,
Yang Zhao
Abstract:
Active learning enhances annotation efficiency by selecting the most revealing samples for labeling, thereby reducing reliance on extensive human input. Previous methods in semantic segmentation have centered on individual pixels or small areas, neglecting the rich patterns in natural images and the power of advanced pre-trained models. To address these challenges, we propose three key contributio…
▽ More
Active learning enhances annotation efficiency by selecting the most revealing samples for labeling, thereby reducing reliance on extensive human input. Previous methods in semantic segmentation have centered on individual pixels or small areas, neglecting the rich patterns in natural images and the power of advanced pre-trained models. To address these challenges, we propose three key contributions: Firstly, we introduce Entity-Superpixel Annotation (ESA), an innovative and efficient active learning strategy which utilizes a class-agnostic mask proposal network coupled with super-pixel grouping to capture local structural cues. Additionally, our method selects a subset of entities within each image of the target domain, prioritizing superpixels with high entropy to ensure comprehensive representation. Simultaneously, it focuses on a limited number of key entities, thereby optimizing for efficiency. By utilizing an annotator-friendly design that capitalizes on the inherent structure of images, our approach significantly outperforms existing pixel-based methods, achieving superior results with minimal queries, specifically reducing click cost by 98% and enhancing performance by 1.71%. For instance, our technique requires a mere 40 clicks for annotation, a stark contrast to the 5000 clicks demanded by conventional methods.
△ Less
Submitted 24 August, 2024;
originally announced August 2024.
-
High quality epitaxial piezoelectric and ferroelectric wurtzite Al$_{1-x}$Sc$_x$N thin films
Authors:
Yang Zeng,
Yihan Lei,
Yanghe Wang,
Mingqiang Cheng,
Luocheng Liao,
Xuyang Wang,
Jinxin Ge,
Zhenghao Liu,
Wenjie Ming,
Chao Li,
Shuhong Xie,
Jiangyu Li,
Changjian Li
Abstract:
Piezoelectric and ferroelectric wurtzite are promising to reshape modern microelectronics because they can be easily integrated with mainstream semiconductor technology. Sc doped AlN (Al$_{1-x}$Sc$_x$N) has attracted much attention for its enhanced piezoelectric and emerging ferroelectric properties, yet the commonly used sputtering results in polycrystalline Al$_{1-x}$Sc$_x$N films with high leak…
▽ More
Piezoelectric and ferroelectric wurtzite are promising to reshape modern microelectronics because they can be easily integrated with mainstream semiconductor technology. Sc doped AlN (Al$_{1-x}$Sc$_x$N) has attracted much attention for its enhanced piezoelectric and emerging ferroelectric properties, yet the commonly used sputtering results in polycrystalline Al$_{1-x}$Sc$_x$N films with high leakage current. Here we report the pulsed laser deposition of single crystalline epitaxial Al$_{1-x}$Sc$_x$N thin films on sapphire and 4H-SiC substrates. Pure wurtzite phase is maintained up to $x = 0.3$ with minimal oxygen contamination. Polarization is estimated to be 140 $μ$C/cm$^2$ via atomic scale microscopy imaging and found to be switchable via a scanning probe. The piezoelectric coefficient is found to be 5 times of undoped one when $x = 0.3$, making it desirable for high frequency radiofrequency (RF) filters and three-dimensional nonvolatile memories.
△ Less
Submitted 21 August, 2024;
originally announced August 2024.
-
Image-Based Geolocation Using Large Vision-Language Models
Authors:
Yi Liu,
Junchen Ding,
Gelei Deng,
Yuekang Li,
Tianwei Zhang,
Weisong Sun,
Yaowen Zheng,
Jingquan Ge,
Yang Liu
Abstract:
Geolocation is now a vital aspect of modern life, offering numerous benefits but also presenting serious privacy concerns. The advent of large vision-language models (LVLMs) with advanced image-processing capabilities introduces new risks, as these models can inadvertently reveal sensitive geolocation information. This paper presents the first in-depth study analyzing the challenges posed by tradi…
▽ More
Geolocation is now a vital aspect of modern life, offering numerous benefits but also presenting serious privacy concerns. The advent of large vision-language models (LVLMs) with advanced image-processing capabilities introduces new risks, as these models can inadvertently reveal sensitive geolocation information. This paper presents the first in-depth study analyzing the challenges posed by traditional deep learning and LVLM-based geolocation methods. Our findings reveal that LVLMs can accurately determine geolocations from images, even without explicit geographic training.
To address these challenges, we introduce \tool{}, an innovative framework that significantly enhances image-based geolocation accuracy. \tool{} employs a systematic chain-of-thought (CoT) approach, mimicking human geoguessing strategies by carefully analyzing visual and contextual cues such as vehicle types, architectural styles, natural landscapes, and cultural elements. Extensive testing on a dataset of 50,000 ground-truth data points shows that \tool{} outperforms both traditional models and human benchmarks in accuracy. It achieves an impressive average score of 4550.5 in the GeoGuessr game, with an 85.37\% win rate, and delivers highly precise geolocation predictions, with the closest distances as accurate as 0.3 km. Furthermore, our study highlights issues related to dataset integrity, leading to the creation of a more robust dataset and a refined framework that leverages LVLMs' cognitive capabilities to improve geolocation precision. These findings underscore \tool{}'s superior ability to interpret complex visual data, the urgent need to address emerging security vulnerabilities posed by LVLMs, and the importance of responsible AI development to ensure user privacy protection.
△ Less
Submitted 18 August, 2024;
originally announced August 2024.
-
System States Forecasting of Microservices with Dynamic Spatio-Temporal Data
Authors:
Yifei Xu,
Jingguo Ge,
Haina Tang,
Shuai Ding,
Tong Li,
Hui Li
Abstract:
In the AIOps (Artificial Intelligence for IT Operations) era, accurately forecasting system states is crucial. In microservices systems, this task encounters the challenge of dynamic and complex spatio-temporal relationships among microservice instances, primarily due to dynamic deployments, diverse call paths, and cascading effects among instances. Current time-series forecasting methods, which f…
▽ More
In the AIOps (Artificial Intelligence for IT Operations) era, accurately forecasting system states is crucial. In microservices systems, this task encounters the challenge of dynamic and complex spatio-temporal relationships among microservice instances, primarily due to dynamic deployments, diverse call paths, and cascading effects among instances. Current time-series forecasting methods, which focus mainly on intrinsic patterns, are insufficient in environments where spatial relationships are critical. Similarly, spatio-temporal graph approaches often neglect the nature of temporal trend, concentrating mostly on message passing between nodes. Moreover, current research in microservices domain frequently underestimates the importance of network metrics and topological structures in capturing the evolving dynamics of systems. This paper introduces STMformer, a model tailored for forecasting system states in microservices environments, capable of handling multi-node and multivariate time series. Our method leverages dynamic network connection data and topological information to assist in modeling the intricate spatio-temporal relationships within the system. Additionally, we integrate the PatchCrossAttention module to compute the impact of cascading effects globally. We have developed a dataset based on a microservices system and conducted comprehensive experiments with STMformer against leading methods. In both short-term and long-term forecasting tasks, our model consistently achieved a 8.6% reduction in MAE(Mean Absolute Error) and a 2.2% reduction in MSE (Mean Squared Error). The source code is available at https://github.com/xuyifeiiie/STMformer.
△ Less
Submitted 14 August, 2024;
originally announced August 2024.
-
Photometric properties of classical bulge and pseudo-bulge galaxies at $0.5\le z<1.0$
Authors:
Jia Hu,
Qifan Cui,
Lan Wang,
Wenxiang Pei,
Junqiang Ge
Abstract:
We compare the photometric properties and specific star formation rate (sSFR) of classical and pseudo-bulge galaxies with $M_* \ge 10^{9.5} \rm M_{\odot}$ at $0.5\le z<1.0$, selected from all five CANDELS fields. We also compare these properties of bulge galaxies at lower redshift selected from MaNGA survey (Hu et al. 2024). This paper aims to study the properties of galaxies with classical and ps…
▽ More
We compare the photometric properties and specific star formation rate (sSFR) of classical and pseudo-bulge galaxies with $M_* \ge 10^{9.5} \rm M_{\odot}$ at $0.5\le z<1.0$, selected from all five CANDELS fields. We also compare these properties of bulge galaxies at lower redshift selected from MaNGA survey (Hu et al. 2024). This paper aims to study the properties of galaxies with classical and pseudo-bulges at intermediate redshift, to compare the differences between different bulge types, and to understand the evolution of bulges with redshift. Galaxies are classified into classical bulge and pseudo-bulge samples according to the S$\mathrm{\acute{e}}$rsic index n of the bulge component based on results of two-component decomposition of galaxies, as well as the position of bulges on the Kormendy diagram. For the 105 classical bulge and 86 pseudo-bulge galaxies selected, we compare their size, luminosity, and sSFR of various components. At given stellar mass, most classical bulge galaxies have smaller effective radii, larger $B/T$, brighter and relatively larger bulges, and less active star formation than pseudo-bulge galaxies. Besides, the two types of galaxies have larger differences in sSFR at large radii than at the central region at both low and mid-redshifts. The differences between properties of the two types of bulge galaxies are in general smaller at mid-redshift than at low redshift, indicating that they are evolving to more distinct populations towards the local universe. Bulge type is correlated with the properties of their outer disks, and the correlation is already present at redshift as high as $0.5<z<1$.
△ Less
Submitted 31 July, 2024;
originally announced July 2024.
-
Discovery of novel antimicrobial peptides with notable antibacterial potency by a LLM-based foundation model
Authors:
Jike Wang,
Jianwen Feng,
Yu Kang,
Peichen Pan,
Jingxuan Ge,
Yan Wang,
Mingyang Wang,
Zhenxing Wu,
Xingcai Zhang,
Jiameng Yu,
Xujun Zhang,
Tianyue Wang,
Lirong Wen,
Guangning Yan,
Yafeng Deng,
Hui Shi,
Chang-Yu Hsieh,
Zhihui Jiang,
Tingjun Hou
Abstract:
Large language models (LLMs) have shown remarkable advancements in chemistry and biomedical research, acting as versatile foundation models for various tasks. We introduce AMP-Designer, an LLM-based approach for swiftly designing novel antimicrobial peptides (AMPs) with desired properties. Within 11 days, AMP-Designer achieved the de novo design of 18 AMPs with broad-spectrum activity against Gram…
▽ More
Large language models (LLMs) have shown remarkable advancements in chemistry and biomedical research, acting as versatile foundation models for various tasks. We introduce AMP-Designer, an LLM-based approach for swiftly designing novel antimicrobial peptides (AMPs) with desired properties. Within 11 days, AMP-Designer achieved the de novo design of 18 AMPs with broad-spectrum activity against Gram-negative bacteria. In vitro validation revealed a 94.4% success rate, with two candidates demonstrating exceptional antibacterial efficacy, minimal hemotoxicity, stability in human plasma, and low potential to induce resistance, as evidenced by significant bacterial load reduction in murine lung infection experiments. The entire process, from design to validation, concluded in 48 days. AMP-Designer excels in creating AMPs targeting specific strains despite limited data availability, with a top candidate displaying a minimum inhibitory concentration of 2.0 μg/ml against Propionibacterium acnes. Integrating advanced machine learning techniques, AMP-Designer demonstrates remarkable efficiency, paving the way for innovative solutions to antibiotic resistance.
△ Less
Submitted 2 March, 2025; v1 submitted 16 July, 2024;
originally announced July 2024.
-
Constraining Weyl type f(Q,T) gravity with Big Bang Nucleosynthesis
Authors:
Jian Ge,
Lei Ming,
Shi-Dong Liang,
Hong-Hao Zhang,
Tiberiu Harko
Abstract:
The Weyl type $f(Q,T)$ modified gravity theory is an extension of the $f(Q)$ and $f(Q,T)$ type theories, where $T$ is the trace of the matter energy-momentum tensor, and the scalar non-metricity $Q$ is represented in its standard Weyl form, and it is fully determined by a vector field $ω_μ$. The theory can give a good description of the observational data, and of the evolution of the late-time Uni…
▽ More
The Weyl type $f(Q,T)$ modified gravity theory is an extension of the $f(Q)$ and $f(Q,T)$ type theories, where $T$ is the trace of the matter energy-momentum tensor, and the scalar non-metricity $Q$ is represented in its standard Weyl form, and it is fully determined by a vector field $ω_μ$. The theory can give a good description of the observational data, and of the evolution of the late-time Universe, including a geometric explanation of the dark energy. In this work we investigate the Big Bang Nucleosynthesis (BBN) constraints on several Weyl type $f(Q,T)$ gravity models. In particular, we consider the corrections that Weyl type $f(Q,T)$ terms induce on the freeze-out temperature $\mathcal{T}_f$, as compared to the standard $Λ$CDM results. We analyze in detail three distinct cosmological models, corresponding to specific choices of the functional form of $f(Q,T)$. The first model has a simple linear additive structure in $Q$ and $T$, the second model is multiplicative in $Q$ and $T$, while the third is additive in $T$ and the exponential of $Q$. For each $f(Q,T)$ we consider first the cosmological evolution in the radiation dominated era, and then we impose the observational bound on $\left|δ\mathcal{T}_f/ \mathcal{T}_f\right|$ to obtain constraints on the model parameters from the primordial abundances of the light elements such as helium-4, deuterium and lithium-7. The abundances of helium-4 and deuterium agree with theoretical predictions, however, the lithium problem, even slightly alleviated, still persists for the considered Weyl type $f(Q,T)$ models. Generally, these models satisfy the BBN constraints, and thus they represent viable cosmologies describing the entire dynamical time scale of the evolution of the Universe.
△ Less
Submitted 14 July, 2024;
originally announced July 2024.
-
High-Resolution Cloud Detection Network
Authors:
Jingsheng Li,
Tianxiang Xue,
Jiayi Zhao,
Jingmin Ge,
Yufang Min,
Wei Su,
Kun Zhan
Abstract:
The complexity of clouds, particularly in terms of texture detail at high resolutions, has not been well explored by most existing cloud detection networks. This paper introduces the High-Resolution Cloud Detection Network (HR-cloud-Net), which utilizes a hierarchical high-resolution integration approach. HR-cloud-Net integrates a high-resolution representation module, layer-wise cascaded feature…
▽ More
The complexity of clouds, particularly in terms of texture detail at high resolutions, has not been well explored by most existing cloud detection networks. This paper introduces the High-Resolution Cloud Detection Network (HR-cloud-Net), which utilizes a hierarchical high-resolution integration approach. HR-cloud-Net integrates a high-resolution representation module, layer-wise cascaded feature fusion module, and multi-resolution pyramid pooling module to effectively capture complex cloud features. This architecture preserves detailed cloud texture information while facilitating feature exchange across different resolutions, thereby enhancing overall performance in cloud detection. Additionally, a novel approach is introduced wherein a student view, trained on noisy augmented images, is supervised by a teacher view processing normal images. This setup enables the student to learn from cleaner supervisions provided by the teacher, leading to improved performance. Extensive evaluations on three optical satellite image cloud detection datasets validate the superior performance of HR-cloud-Net compared to existing methods.The source code is available at \url{https://github.com/kunzhan/HR-cloud-Net}.
△ Less
Submitted 10 July, 2024;
originally announced July 2024.
-
Training Task Experts through Retrieval Based Distillation
Authors:
Jiaxin Ge,
Xueying Jia,
Vijay Viswanathan,
Hongyin Luo,
Graham Neubig
Abstract:
One of the most reliable ways to create deployable models for specialized tasks is to obtain an adequate amount of high-quality task-specific data. However, for specialized tasks, often such datasets do not exist. Existing methods address this by creating such data from large language models (LLMs) and then distilling such knowledge into smaller models. However, these methods are limited by the qu…
▽ More
One of the most reliable ways to create deployable models for specialized tasks is to obtain an adequate amount of high-quality task-specific data. However, for specialized tasks, often such datasets do not exist. Existing methods address this by creating such data from large language models (LLMs) and then distilling such knowledge into smaller models. However, these methods are limited by the quality of the LLMs output, and tend to generate repetitive or incorrect data. In this work, we present Retrieval Based Distillation (ReBase), a method that first retrieves data from rich online sources and then transforms them into domain-specific data. This method greatly enhances data diversity. Moreover, ReBase generates Chain-of-Thought reasoning and distills the reasoning capacity of LLMs. We test our method on 4 benchmarks and results show that our method significantly improves performance by up to 7.8% on SQuAD, 1.37% on MNLI, and 1.94% on BigBench-Hard.
△ Less
Submitted 7 July, 2024;
originally announced July 2024.
-
Machine Learning for Economic Forecasting: An Application to China's GDP Growth
Authors:
Yanqing Yang,
Xingcheng Xu,
Jinfeng Ge,
Yan Xu
Abstract:
This paper aims to explore the application of machine learning in forecasting Chinese macroeconomic variables. Specifically, it employs various machine learning models to predict the quarterly real GDP growth of China, and analyzes the factors contributing to the performance differences among these models. Our findings indicate that the average forecast errors of machine learning models are genera…
▽ More
This paper aims to explore the application of machine learning in forecasting Chinese macroeconomic variables. Specifically, it employs various machine learning models to predict the quarterly real GDP growth of China, and analyzes the factors contributing to the performance differences among these models. Our findings indicate that the average forecast errors of machine learning models are generally lower than those of traditional econometric models or expert forecasts, particularly in periods of economic stability. However, during certain inflection points, although machine learning models still outperform traditional econometric models, expert forecasts may exhibit greater accuracy in some instances due to experts' more comprehensive understanding of the macroeconomic environment and real-time economic variables. In addition to macroeconomic forecasting, this paper employs interpretable machine learning methods to identify the key attributive variables from different machine learning models, aiming to enhance the understanding and evaluation of their contributions to macroeconomic fluctuations.
△ Less
Submitted 3 July, 2024;
originally announced July 2024.
-
InternLM-Law: An Open Source Chinese Legal Large Language Model
Authors:
Zhiwei Fei,
Songyang Zhang,
Xiaoyu Shen,
Dawei Zhu,
Xiao Wang,
Maosong Cao,
Fengzhe Zhou,
Yining Li,
Wenwei Zhang,
Dahua Lin,
Kai Chen,
Jidong Ge
Abstract:
While large language models (LLMs) have showcased impressive capabilities, they struggle with addressing legal queries due to the intricate complexities and specialized expertise required in the legal field. In this paper, we introduce InternLM-Law, a specialized LLM tailored for addressing diverse legal queries related to Chinese laws, spanning from responding to standard legal questions (e.g., l…
▽ More
While large language models (LLMs) have showcased impressive capabilities, they struggle with addressing legal queries due to the intricate complexities and specialized expertise required in the legal field. In this paper, we introduce InternLM-Law, a specialized LLM tailored for addressing diverse legal queries related to Chinese laws, spanning from responding to standard legal questions (e.g., legal exercises in textbooks) to analyzing complex real-world legal situations. We meticulously construct a dataset in the Chinese legal domain, encompassing over 1 million queries, and implement a data filtering and processing pipeline to ensure its diversity and quality. Our training approach involves a novel two-stage process: initially fine-tuning LLMs on both legal-specific and general-purpose content to equip the models with broad knowledge, followed by exclusive fine-tuning on high-quality legal data to enhance structured output generation. InternLM-Law achieves the highest average performance on LawBench, outperforming state-of-the-art models, including GPT-4, on 13 out of 20 subtasks. We make InternLM-Law and our dataset publicly available to facilitate future research in applying LLMs within the legal domain.
△ Less
Submitted 21 June, 2024;
originally announced June 2024.
-
Parameter-Inverted Image Pyramid Networks
Authors:
Xizhou Zhu,
Xue Yang,
Zhaokai Wang,
Hao Li,
Wenhan Dou,
Junqi Ge,
Lewei Lu,
Yu Qiao,
Jifeng Dai
Abstract:
Image pyramids are commonly used in modern computer vision tasks to obtain multi-scale features for precise understanding of images. However, image pyramids process multiple resolutions of images using the same large-scale model, which requires significant computational cost. To overcome this issue, we propose a novel network architecture known as the Parameter-Inverted Image Pyramid Networks (PII…
▽ More
Image pyramids are commonly used in modern computer vision tasks to obtain multi-scale features for precise understanding of images. However, image pyramids process multiple resolutions of images using the same large-scale model, which requires significant computational cost. To overcome this issue, we propose a novel network architecture known as the Parameter-Inverted Image Pyramid Networks (PIIP). Our core idea is to use models with different parameter sizes to process different resolution levels of the image pyramid, thereby balancing computational efficiency and performance. Specifically, the input to PIIP is a set of multi-scale images, where higher resolution images are processed by smaller networks. We further propose a feature interaction mechanism to allow features of different resolutions to complement each other and effectively integrate information from different spatial scales. Extensive experiments demonstrate that the PIIP achieves superior performance in tasks such as object detection, segmentation, and image classification, compared to traditional image pyramid methods and single-branch networks, while reducing computational cost. Notably, when applying our method on a large-scale vision foundation model InternViT-6B, we improve its performance by 1%-2% on detection and segmentation with only 40%-60% of the original computation. These results validate the effectiveness of the PIIP approach and provide a new technical direction for future vision computing tasks. Our code and models are available at https://github.com/OpenGVLab/PIIP.
△ Less
Submitted 28 October, 2024; v1 submitted 6 June, 2024;
originally announced June 2024.
-
Securing Equal Share: A Principled Approach for Learning Multiplayer Symmetric Games
Authors:
Jiawei Ge,
Yuanhao Wang,
Wenzhe Li,
Chi Jin
Abstract:
This paper examines multiplayer symmetric constant-sum games with more than two players in a competitive setting, including examples like Mahjong, Poker, and various board and video games. In contrast to two-player zero-sum games, equilibria in multiplayer games are neither unique nor non-exploitable, failing to provide meaningful guarantees when competing against opponents who play different equi…
▽ More
This paper examines multiplayer symmetric constant-sum games with more than two players in a competitive setting, including examples like Mahjong, Poker, and various board and video games. In contrast to two-player zero-sum games, equilibria in multiplayer games are neither unique nor non-exploitable, failing to provide meaningful guarantees when competing against opponents who play different equilibria or non-equilibrium strategies. This gives rise to a series of long-lasting fundamental questions in multiplayer games regarding suitable objectives, solution concepts, and principled algorithms. This paper takes an initial step towards addressing these challenges by focusing on the natural objective of equal share -- securing an expected payoff of C/n in an n-player symmetric game with a total payoff of C. We rigorously identify the theoretical conditions under which achieving an equal share is tractable and design a series of efficient algorithms, inspired by no-regret learning, that provably attain approximate equal share across various settings. Furthermore, we provide complementary lower bounds that justify the sharpness of our theoretical results. Our experimental results highlight worst-case scenarios where meta-algorithms from prior state-of-the-art systems for multiplayer games fail to secure an equal share, while our algorithm succeeds, demonstrating the effectiveness of our approach.
△ Less
Submitted 2 October, 2024; v1 submitted 6 June, 2024;
originally announced June 2024.
-
A Self-Correcting Vision-Language-Action Model for Fast and Slow System Manipulation
Authors:
Chenxuan Li,
Jiaming Liu,
Guanqun Wang,
Xiaoqi Li,
Sixiang Chen,
Liang Heng,
Chuyan Xiong,
Jiaxin Ge,
Renrui Zhang,
Kaichen Zhou,
Shanghang Zhang
Abstract:
Recently, some studies have integrated Multimodal Large Language Models into robotic manipulation, constructing vision-language-action models (VLAs) to interpret multimodal information and predict SE(3) poses. While VLAs have shown promising progress, they may suffer from failures when faced with novel and complex tasks. To emulate human-like reasoning for more robust manipulation, we propose the…
▽ More
Recently, some studies have integrated Multimodal Large Language Models into robotic manipulation, constructing vision-language-action models (VLAs) to interpret multimodal information and predict SE(3) poses. While VLAs have shown promising progress, they may suffer from failures when faced with novel and complex tasks. To emulate human-like reasoning for more robust manipulation, we propose the self-corrected (SC-)VLA framework, which integrates fast system for directly predicting actions and slow system for reflecting on failed actions within a single VLA policy. For the fast system, we incorporate parameter-efficient fine-tuning to equip the model with pose prediction capabilities while preserving the inherent reasoning abilities of MLLMs. For the slow system, we propose a Chain-of-Thought training strategy for failure correction, designed to mimic human reflection after a manipulation failure. Specifically, our model learns to identify the causes of action failures, adaptively seek expert feedback, reflect on the current failure scenario, and iteratively generate corrective actions, step by step. Furthermore, a continuous policy learning method is designed based on successfully corrected samples, enhancing the fast system's adaptability to the current configuration. We compare SC-VLA with the previous SOTA VLA in both simulation and real-world tasks, demonstrating an efficient correction process and improved manipulation accuracy on both seen and unseen tasks.
△ Less
Submitted 18 March, 2025; v1 submitted 27 May, 2024;
originally announced May 2024.
-
Optimal Aggregation of Prediction Intervals under Unsupervised Domain Shift
Authors:
Jiawei Ge,
Debarghya Mukherjee,
Jianqing Fan
Abstract:
As machine learning models are increasingly deployed in dynamic environments, it becomes paramount to assess and quantify uncertainties associated with distribution shifts. A distribution shift occurs when the underlying data-generating process changes, leading to a deviation in the model's performance. The prediction interval, which captures the range of likely outcomes for a given prediction, se…
▽ More
As machine learning models are increasingly deployed in dynamic environments, it becomes paramount to assess and quantify uncertainties associated with distribution shifts. A distribution shift occurs when the underlying data-generating process changes, leading to a deviation in the model's performance. The prediction interval, which captures the range of likely outcomes for a given prediction, serves as a crucial tool for characterizing uncertainties induced by their underlying distribution. In this paper, we propose methodologies for aggregating prediction intervals to obtain one with minimal width and adequate coverage on the target domain under unsupervised domain shift, under which we have labeled samples from a related source domain and unlabeled covariates from the target domain. Our analysis encompasses scenarios where the source and the target domain are related via i) a bounded density ratio, and ii) a measure-preserving transformation. Our proposed methodologies are computationally efficient and easy to implement. Beyond illustrating the performance of our method through real-world datasets, we also delve into the theoretical details. This includes establishing rigorous theoretical guarantees, coupled with finite sample bounds, regarding the coverage and width of our prediction intervals. Our approach excels in practical applications and is underpinned by a solid theoretical framework, ensuring its reliability and effectiveness across diverse contexts.
△ Less
Submitted 7 October, 2024; v1 submitted 16 May, 2024;
originally announced May 2024.
-
Communication-Efficient Collaborative Perception via Information Filling with Codebook
Authors:
Yue Hu,
Juntong Peng,
Sifei Liu,
Junhao Ge,
Si Liu,
Siheng Chen
Abstract:
Collaborative perception empowers each agent to improve its perceptual ability through the exchange of perceptual messages with other agents. It inherently results in a fundamental trade-off between perception ability and communication cost. To address this bottleneck issue, our core idea is to optimize the collaborative messages from two key aspects: representation and selection. The proposed cod…
▽ More
Collaborative perception empowers each agent to improve its perceptual ability through the exchange of perceptual messages with other agents. It inherently results in a fundamental trade-off between perception ability and communication cost. To address this bottleneck issue, our core idea is to optimize the collaborative messages from two key aspects: representation and selection. The proposed codebook-based message representation enables the transmission of integer codes, rather than high-dimensional feature maps. The proposed information-filling-driven message selection optimizes local messages to collectively fill each agent's information demand, preventing information overflow among multiple agents. By integrating these two designs, we propose CodeFilling, a novel communication-efficient collaborative perception system, which significantly advances the perception-communication trade-off and is inclusive to both homogeneous and heterogeneous collaboration settings. We evaluate CodeFilling in both a real-world dataset, DAIR-V2X, and a new simulation dataset, OPV2VH+. Results show that CodeFilling outperforms previous SOTA Where2comm on DAIR-V2X/OPV2VH+ with 1,333/1,206 times lower communication volume. Our code is available at https://github.com/PhyllisH/CodeFilling.
△ Less
Submitted 8 May, 2024;
originally announced May 2024.
-
Life-long Learning and Testing for Automated Vehicles via Adaptive Scenario Sampling as A Continuous Optimization Process
Authors:
Jingwei Ge,
Pengbo Wang,
Cheng Chang,
Yi Zhang,
Danya Yao,
Li Li
Abstract:
Sampling critical testing scenarios is an essential step in intelligence testing for Automated Vehicles (AVs). However, due to the lack of prior knowledge on the distribution of critical scenarios in sampling space, we can hardly efficiently find the critical scenarios or accurately evaluate the intelligence of AVs. To solve this problem, we formulate the testing as a continuous optimization proce…
▽ More
Sampling critical testing scenarios is an essential step in intelligence testing for Automated Vehicles (AVs). However, due to the lack of prior knowledge on the distribution of critical scenarios in sampling space, we can hardly efficiently find the critical scenarios or accurately evaluate the intelligence of AVs. To solve this problem, we formulate the testing as a continuous optimization process which iteratively generates potential critical scenarios and meanwhile evaluates these scenarios. A bi-level loop is proposed for such life-long learning and testing. In the outer loop, we iteratively learn space knowledge by evaluating AV in the already sampled scenarios and then sample new scenarios based on the retained knowledge. Outer loop stops when all generated samples cover the whole space. While to maximize the coverage of the space in each outer loop, we set an inner loop which receives newly generated samples in outer loop and outputs the updated positions of these samples. We assume that points in a small sphere-like subspace can be covered (or represented) by the point in the center of this sphere. Therefore, we can apply a multi-rounds heuristic strategy to move and pack these spheres in space to find the best covering solution. The simulation results show that faster and more accurate evaluation of AVs can be achieved with more critical scenarios.
△ Less
Submitted 28 March, 2024;
originally announced May 2024.
-
Towards Symbiotic SAGIN Through Inter-operator Resource and Service Sharing: Joint Orchestration of User Association and Radio Resources
Authors:
Shizhao He,
Jungang Ge,
Ying-Chang Liang,
Dusit Niyato
Abstract:
The space-air-ground integrated network (SAGIN) is a pivotal architecture to support ubiquitous connectivity in the upcoming 6G era. Inter-operator resource and service sharing is a promising way to realize such a huge network, utilizing resources efficiently and reducing construction costs. Given the rationality of operators, the configuration of resources and services in SAGIN should focus on bo…
▽ More
The space-air-ground integrated network (SAGIN) is a pivotal architecture to support ubiquitous connectivity in the upcoming 6G era. Inter-operator resource and service sharing is a promising way to realize such a huge network, utilizing resources efficiently and reducing construction costs. Given the rationality of operators, the configuration of resources and services in SAGIN should focus on both the overall system performance and individual benefits of operators. Motivated by emerging symbiotic communication facilitating mutual benefits across different radio systems, we investigate the resource and service sharing in SAGIN from a symbiotic communication perspective in this paper. In particular, we consider a SAGIN consisting of a ground network operator (GNO) and a satellite network operator (SNO). Specifically, we aim to maximize the weighted sum rate (WSR) of the whole SAGIN by jointly optimizing the user association, resource allocation, and beamforming. Besides, we introduce a sharing coefficient to characterize the revenue of operators. Operators may suffer revenue loss when only focusing on maximizing the WSR. In pursuit of mutual benefits, we propose a mutual benefit constraint (MBC) to ensure that each operator obtains revenue gains. Then, we develop a centralized algorithm based on the successive convex approximation (SCA) method. Considering that the centralized algorithm is difficult to implement, we propose a distributed algorithm based on Lagrangian dual decomposition and the consensus alternating direction method of multipliers (ADMM). Finally, we provide extensive numerical simulations to demonstrate the effectiveness of the two proposed algorithms, and the distributed optimization algorithm can approach the performance of the centralized one.
△ Less
Submitted 25 April, 2024;
originally announced April 2024.
-
Reconstructing Intrinsic Stellar Noise with Stellar Atmospheric Parameters and Chromospheric Activity
Authors:
Jinghua Zhang,
Maosheng Xiang,
Jie Yu,
Jian Ge,
Ji-Wei Xie,
Hui Zhang,
Yaguang Li,
You Wu,
Chun-Qian Li,
Shaolan Bi,
Hong-Liang Yan,
Jian-Rong Shi
Abstract:
Accurately characterizing intrinsic stellar photometric noise induced by stellar astrophysics, such as stellar activity, granulation, and oscillations, is of crucial importance for detecting transiting exoplanets. In this study, we investigate the relation between the intrinsic stellar photometric noise, as quantified by the Kepler rrmsCDPP measurement, and the level of stellar chromospheric activ…
▽ More
Accurately characterizing intrinsic stellar photometric noise induced by stellar astrophysics, such as stellar activity, granulation, and oscillations, is of crucial importance for detecting transiting exoplanets. In this study, we investigate the relation between the intrinsic stellar photometric noise, as quantified by the Kepler rrmsCDPP measurement, and the level of stellar chromospheric activity, as indicated by the S-index of Ca II HK lines derived from the LAMOST spectra. Our results reveal a clear positive correlation between S-index and rrmsCDPP, and the correlation becomes more significant at higher activity levels and on longer timescales. We have therefore built an empirical relation between rrmsCDPP and S-index as well as Teff, logg, [Fe/H], and apparent magnitude with the XGBoost regression algorithm, using the LAMOST-Kepler common star sample as the training set. This method achieves a precision of ~20 ppm for inferring the intrinsic noise from the S-index and other stellar labels on a 6-hour integration duration. We have applied this empirical relation to the full LAMOST DR7 spectra database, and obtained the intrinsic noise predictions for 1,358,275 stars. The resultant catalog is publicly available and expected to be valuable for optimizing target selection for future exoplanet-hunting space missions, such as the Earth 2.0 mission.
△ Less
Submitted 21 April, 2024;
originally announced April 2024.
-
Can near-to-mid infrared spectral energy distribution quantitatively trace protoplanetary disk evolution?
Authors:
Mingchao Liu,
Jinhua He,
Zhen Guo,
Jixing Ge,
Yuping Tang
Abstract:
Infrared (IR) spectral energy distribution (SED) is the major tracer of protoplanetary disks. It was recently proposed to use the near-to-mid IR (or K-24) SED slope $α$ defined between 2-24$μ$m as a potential quantitative tracer of disk age. We critically examine the viability of this idea and confront it with additional statistics of IR luminosities and SED shapes. We point out that, because the…
▽ More
Infrared (IR) spectral energy distribution (SED) is the major tracer of protoplanetary disks. It was recently proposed to use the near-to-mid IR (or K-24) SED slope $α$ defined between 2-24$μ$m as a potential quantitative tracer of disk age. We critically examine the viability of this idea and confront it with additional statistics of IR luminosities and SED shapes. We point out that, because the statistical properties of most of the complicated physical factors involved in disk evolution are still poorly understood in a quantitative sense, the only viable way is to assume them to be random so that an idealized `average disk' can be defined, which allows the $α$ histogram to trace its age. We confirm that the statistics of the zeroth order (luminosity), first order (slope $α$) and second order characteristics (concavity) of the observed K-24 SEDs indeed carry useful information upon the evolutionary processes of the `average disk'. We also stress that intrinsic diversities in K-24 SED shapes and luminosities are always large at the level of individual stars so that the application of the evolutionary path of the `average disk' to individual stars must be done with care. The data of most curves in plots are provided on GitHub.
△ Less
Submitted 11 May, 2024; v1 submitted 16 April, 2024;
originally announced April 2024.
-
Towards Collaborative Autonomous Driving: Simulation Platform and End-to-End System
Authors:
Genjia Liu,
Yue Hu,
Chenxin Xu,
Weibo Mao,
Junhao Ge,
Zhengxiang Huang,
Yifan Lu,
Yinda Xu,
Junkai Xia,
Yafei Wang,
Siheng Chen
Abstract:
Vehicle-to-everything-aided autonomous driving (V2X-AD) has a huge potential to provide a safer driving solution. Despite extensive researches in transportation and communication to support V2X-AD, the actual utilization of these infrastructures and communication resources in enhancing driving performances remains largely unexplored. This highlights the necessity of collaborative autonomous drivin…
▽ More
Vehicle-to-everything-aided autonomous driving (V2X-AD) has a huge potential to provide a safer driving solution. Despite extensive researches in transportation and communication to support V2X-AD, the actual utilization of these infrastructures and communication resources in enhancing driving performances remains largely unexplored. This highlights the necessity of collaborative autonomous driving: a machine learning approach that optimizes the information sharing strategy to improve the driving performance of each vehicle. This effort necessitates two key foundations: a platform capable of generating data to facilitate the training and testing of V2X-AD, and a comprehensive system that integrates full driving-related functionalities with mechanisms for information sharing. From the platform perspective, we present V2Xverse, a comprehensive simulation platform for collaborative autonomous driving. This platform provides a complete pipeline for collaborative driving. From the system perspective, we introduce CoDriving, a novel end-to-end collaborative driving system that properly integrates V2X communication over the entire autonomous pipeline, promoting driving with shared perceptual information. The core idea is a novel driving-oriented communication strategy. Leveraging this strategy, CoDriving improves driving performance while optimizing communication efficiency. We make comprehensive benchmarks with V2Xverse, analyzing both modular performance and closed-loop driving performance. Experimental results show that CoDriving: i) significantly improves the driving score by 62.49% and drastically reduces the pedestrian collision rate by 53.50% compared to the SOTA end-to-end driving method, and ii) achieves sustaining driving performance superiority over dynamic constraint communication conditions.
△ Less
Submitted 9 April, 2025; v1 submitted 15 April, 2024;
originally announced April 2024.
-
Open-Source AI-based SE Tools: Opportunities and Challenges of Collaborative Software Learning
Authors:
Zhihao Lin,
Wei Ma,
Tao Lin,
Yaowen Zheng,
Jingquan Ge,
Jun Wang,
Jacques Klein,
Tegawende Bissyande,
Yang Liu,
Li Li
Abstract:
Large Language Models (LLMs) have become instrumental in advancing software engineering (SE) tasks, showcasing their efficacy in code understanding and beyond. Like traditional SE tools, open-source collaboration is key in realising the excellent products. However, with AI models, the essential need is in data. The collaboration of these AI-based SE models hinges on maximising the sources of high-…
▽ More
Large Language Models (LLMs) have become instrumental in advancing software engineering (SE) tasks, showcasing their efficacy in code understanding and beyond. Like traditional SE tools, open-source collaboration is key in realising the excellent products. However, with AI models, the essential need is in data. The collaboration of these AI-based SE models hinges on maximising the sources of high-quality data. However, data especially of high quality, often holds commercial or sensitive value, making it less accessible for open-source AI-based SE projects. This reality presents a significant barrier to the development and enhancement of AI-based SE tools within the software engineering community. Therefore, researchers need to find solutions for enabling open-source AI-based SE models to tap into resources by different organisations. Addressing this challenge, our position paper investigates one solution to facilitate access to diverse organizational resources for open-source AI models, ensuring privacy and commercial sensitivities are respected. We introduce a governance framework centered on federated learning (FL), designed to foster the joint development and maintenance of open-source AI code models while safeguarding data privacy and security. Additionally, we present guidelines for developers on AI-based SE tool collaboration, covering data requirements, model architecture, updating strategies, and version control. Given the significant influence of data characteristics on FL, our research examines the effect of code data heterogeneity on FL performance.
△ Less
Submitted 9 April, 2024;
originally announced April 2024.
-
On Variation of Light Curves and Broad Emission Lines for Periodic QSOs from co-rotating Supermassive binary black holes in elliptical orbits
Authors:
Junqiang Ge,
Youjun Lu,
Changshuo Yan,
Jifeng Liu
Abstract:
Context. Periodic QSOs are considered as candidates of supermassive binary black hole (BBH) systems in galactic centers. Further confirmation of these candidates may require different lines of observational evidences. Aims. Assuming the Doopler boosting scenario, in this paper we investigate the (coherent) variations of both broad emission lines (BELs) and continuum light curves for active BBH sys…
▽ More
Context. Periodic QSOs are considered as candidates of supermassive binary black hole (BBH) systems in galactic centers. Further confirmation of these candidates may require different lines of observational evidences. Aims. Assuming the Doopler boosting scenario, in this paper we investigate the (coherent) variations of both broad emission lines (BELs) and continuum light curves for active BBH systems surrounding by a circumbinary broad line region (cBLR) and focus on their dependence on the eccentric orbital configuration. Methods. We calculate the variation of continuum light and the Doppler enhanced/weakened photoionization of each BLR cloud according to the motion of BBHs in elliptical orbits, and finally obtain the coherent variation of the continuum and BELs. Results. We find that both the amplitude and variation pattern of the continuum light curves and the evolution of the BEL profiles sensitively depend on the eccentric orbital configuration of BBH systems. If only the secondary BH is active, both the variation amplitudes of continuum light curves and BELs increase with increasing BBH inclination angles and orbital eccentricities, but decrease with increasing BBH mass ratio. If both BHs are active, the asymmetry in the ionization of BLR clouds at different areas caused by the Doppler boosting effect of the secondary BH is weakened due to that of the primary BH at the opposite direction, which leads to systematically smaller variation amplitudes of both continuum light curves and BELs than the cases with only secondary BH activated. Conclusions. The coherent variations of the BEL profiles with the continuum light for those periodic QSOs provide an important way to confirm the existence of BBHs in their center. Future joint analysis of the light curves and multi-epoch observed BEL profiles for periodic QSOs may lead to the identification of a number of BBH systems.
△ Less
Submitted 8 April, 2024;
originally announced April 2024.
-
AlignZeg: Mitigating Objective Misalignment for Zero-shot Semantic Segmentation
Authors:
Jiannan Ge,
Lingxi Xie,
Hongtao Xie,
Pandeng Li,
Xiaopeng Zhang,
Yongdong Zhang,
Qi Tian
Abstract:
A serious issue that harms the performance of zero-shot visual recognition is named objective misalignment, i.e., the learning objective prioritizes improving the recognition accuracy of seen classes rather than unseen classes, while the latter is the true target to pursue. This issue becomes more significant in zero-shot image segmentation because the stronger (i.e., pixel-level) supervision brin…
▽ More
A serious issue that harms the performance of zero-shot visual recognition is named objective misalignment, i.e., the learning objective prioritizes improving the recognition accuracy of seen classes rather than unseen classes, while the latter is the true target to pursue. This issue becomes more significant in zero-shot image segmentation because the stronger (i.e., pixel-level) supervision brings a larger gap between seen and unseen classes. To mitigate it, we propose a novel architecture named AlignZeg, which embodies a comprehensive improvement of the segmentation pipeline, including proposal extraction, classification, and correction, to better fit the goal of zero-shot segmentation. (1) Mutually-Refined Proposal Extraction. AlignZeg harnesses a mutual interaction between mask queries and visual features, facilitating detailed class-agnostic mask proposal extraction. (2) Generalization-Enhanced Proposal Classification. AlignZeg introduces synthetic data and incorporates multiple background prototypes to allocate a more generalizable feature space. (3) Predictive Bias Correction. During the inference stage, AlignZeg uses a class indicator to find potential unseen class proposals followed by a prediction postprocess to correct the prediction bias. Experiments demonstrate that AlignZeg markedly enhances zero-shot semantic segmentation, as shown by an average 3.8% increase in hIoU, primarily attributed to a 7.1% improvement in identifying unseen classes, and we further validate that the improvement comes from alleviating the objective misalignment issue.
△ Less
Submitted 8 April, 2024;
originally announced April 2024.
-
Are High $Σ_1$ Massive Blue Spiral Galaxies Rejuvenated Systems?
Authors:
Cai-Na Hao,
Xiaoyang Xia,
Yong Shi,
Rui Guo,
Yanmei Chen,
Shuai Feng,
Junqiang Ge,
Qiusheng Gu
Abstract:
Quiescent galaxies generally possess denser cores than star-forming galaxies with similar mass. As a measurement of the core density, the central stellar mass surface density within a radius of 1 kpc ($Σ_1$) was thus suggested to be closely related to galaxy quenching. Massive star-forming galaxies with high $Σ_1$ do not fit into this picture. To understand the origin of such galaxies, we compare…
▽ More
Quiescent galaxies generally possess denser cores than star-forming galaxies with similar mass. As a measurement of the core density, the central stellar mass surface density within a radius of 1 kpc ($Σ_1$) was thus suggested to be closely related to galaxy quenching. Massive star-forming galaxies with high $Σ_1$ do not fit into this picture. To understand the origin of such galaxies, we compare the spatially-resolved stellar population and star formation properties of massive ($ > 10^{10.5}{\rm M}_{\odot}$) blue spiral galaxies with high and low $Σ_1$, divided by $Σ_1 = 10^{9.4} M_\odot \, {\rm kpc}^{-2}$, based on the final release of MaNGA IFU data. We find that both high $Σ_1$ and low $Σ_1$ blue spirals show large diversities in stellar population and star formation properties. Despite the diversities, high $Σ_1$ blue spirals are statistically different from the low $Σ_1$ ones. Specifically, the radial profiles of the luminosity-weighted age and Mgb/${\rm \langle Fe \rangle}$ show that high $Σ_1$ blue spirals consist of a larger fraction of galaxies with younger and less $α$-element enhanced centers than their low $Σ_1$ counterparts, $\sim 55\%$ versus $\sim 30\%$. The galaxies with younger centers mostly have higher central specific star formation rates, which still follow the spaxel-based star formation main sequence relation though. Examinations of the H$α$ velocity field and the optical structures suggest that galactic bars or galaxy interactions should be responsible for the rejuvenation of these galaxies. The remaining $\sim 45\% $ of high $Σ_1$ blue spirals are consistent with the inside-out growth scenario.
△ Less
Submitted 4 April, 2024;
originally announced April 2024.
-
FT2Ra: A Fine-Tuning-Inspired Approach to Retrieval-Augmented Code Completion
Authors:
Qi Guo,
Xiaohong Li,
Xiaofei Xie,
Shangqing Liu,
Ze Tang,
Ruitao Feng,
Junjie Wang,
Jidong Ge,
Lei Bu
Abstract:
The rise of code pre-trained models has significantly enhanced various coding tasks, such as code completion, and tools like GitHub Copilot. However, the substantial size of these models, especially large models, poses a significant challenge when it comes to fine-tuning them for specific downstream tasks. As an alternative approach, retrieval-based methods have emerged as a promising solution, au…
▽ More
The rise of code pre-trained models has significantly enhanced various coding tasks, such as code completion, and tools like GitHub Copilot. However, the substantial size of these models, especially large models, poses a significant challenge when it comes to fine-tuning them for specific downstream tasks. As an alternative approach, retrieval-based methods have emerged as a promising solution, augmenting model predictions without the need for fine-tuning. Despite their potential, a significant challenge is that the designs of these methods often rely on heuristics, leaving critical questions about what information should be stored or retrieved and how to interpolate such information for augmenting predictions.
To tackle this challenge, we first perform a theoretical analysis of the fine-tuning process, highlighting the importance of delta logits as a catalyst for improving model predictions. Building on this insight, we develop a novel retrieval-based method, FT2Ra, which aims to mimic genuine fine-tuning. While FT2Ra adopts a retrieval-based mechanism, it uniquely adopts a paradigm with a learning rate and multi-epoch retrievals, which is similar to fine-tuning.In token-level completion, which represents a relatively easier task, FT2Ra achieves a 4.29% improvement in accuracy compared to the best baseline method on UniXcoder. In the more challenging line-level completion task, we observe a substantial more than twice increase in Exact Match (EM) performance, indicating the significant advantages of our theoretical analysis. Notably, even when operating without actual fine-tuning, FT2Ra exhibits competitive performance compared to the models with real fine-tuning.
△ Less
Submitted 1 April, 2024;
originally announced April 2024.
-
InternLM2 Technical Report
Authors:
Zheng Cai,
Maosong Cao,
Haojiong Chen,
Kai Chen,
Keyu Chen,
Xin Chen,
Xun Chen,
Zehui Chen,
Zhi Chen,
Pei Chu,
Xiaoyi Dong,
Haodong Duan,
Qi Fan,
Zhaoye Fei,
Yang Gao,
Jiaye Ge,
Chenya Gu,
Yuzhe Gu,
Tao Gui,
Aijia Guo,
Qipeng Guo,
Conghui He,
Yingfan Hu,
Ting Huang,
Tao Jiang
, et al. (75 additional authors not shown)
Abstract:
The evolution of Large Language Models (LLMs) like ChatGPT and GPT-4 has sparked discussions on the advent of Artificial General Intelligence (AGI). However, replicating such advancements in open-source models has been challenging. This paper introduces InternLM2, an open-source LLM that outperforms its predecessors in comprehensive evaluations across 6 dimensions and 30 benchmarks, long-context m…
▽ More
The evolution of Large Language Models (LLMs) like ChatGPT and GPT-4 has sparked discussions on the advent of Artificial General Intelligence (AGI). However, replicating such advancements in open-source models has been challenging. This paper introduces InternLM2, an open-source LLM that outperforms its predecessors in comprehensive evaluations across 6 dimensions and 30 benchmarks, long-context modeling, and open-ended subjective evaluations through innovative pre-training and optimization techniques. The pre-training process of InternLM2 is meticulously detailed, highlighting the preparation of diverse data types including text, code, and long-context data. InternLM2 efficiently captures long-term dependencies, initially trained on 4k tokens before advancing to 32k tokens in pre-training and fine-tuning stages, exhibiting remarkable performance on the 200k ``Needle-in-a-Haystack" test. InternLM2 is further aligned using Supervised Fine-Tuning (SFT) and a novel Conditional Online Reinforcement Learning from Human Feedback (COOL RLHF) strategy that addresses conflicting human preferences and reward hacking. By releasing InternLM2 models in different training stages and model sizes, we provide the community with insights into the model's evolution.
△ Less
Submitted 25 March, 2024;
originally announced March 2024.
-
RIS-assisted Cell-Free Massive MIMO Systems With Two-Timescale Design and Hardware Impairments
Authors:
Jianxin Dai,
Jin Ge,
Kangda Zhi,
Cunhua Pan,
Youguo Wang
Abstract:
Integrating the reconfigurable intelligent surface (RIS) into a cell-free massive multiple-input multiple-output (CF-mMIMO) system is an effective solution to achieve high system capacity with low cost and power consumption. However, existing works of RIS-assisted systems mostly assumed perfect hardware, while the impact of hardware impairments (HWIs) is generally ignored. In this paper, we consid…
▽ More
Integrating the reconfigurable intelligent surface (RIS) into a cell-free massive multiple-input multiple-output (CF-mMIMO) system is an effective solution to achieve high system capacity with low cost and power consumption. However, existing works of RIS-assisted systems mostly assumed perfect hardware, while the impact of hardware impairments (HWIs) is generally ignored. In this paper, we consider the general Rician fading channel and uplink transmission of the RIS-assisted CF-mMIMO system under transceiver impairments and RIS phase noise. To reduce the feedback overhead and power consumption, we propose a two-timescale transmission scheme to optimize the passive beamformers at RISs with statistical channel state information (CSI), while transmit beamformers at access points (APs) are designed based on instantaneous CSI. Also, the maximum ratio combining (MRC) detection is applied to the central processing unit (CPU). On this basis, we derive the closed-form approximate expression of the achievable rate, based on which the impact of HWIs and the power scaling laws are analyzed to draw useful theoretical insights. To maximize the users' sum rate or minimum rate, we first transform our rate expression into a tractable form, and then optimize the phase shifts of RISs based on an accelerated gradient ascent method. Finally, numerical results are presented to demonstrate the correctness of our derived expressions and validate the previous analysis, which provide some guidelines for the practical application of the imperfect RISs in the CF-mMIMO with transceiver HWIs.
△ Less
Submitted 26 March, 2024; v1 submitted 22 March, 2024;
originally announced March 2024.
-
Suppression of flux jumps in high-$J_c$ Nb$_3$Sn conductors by ferromagnetic layer
Authors:
Cun Xue,
Kai-Wei Cao,
Tian He,
Chong Wei,
Wei Liu,
Jun-Yi Ge
Abstract:
Flux jumps observed in high-$J_c$ Nb$_3$Sn conductors are urgent problems to construct high field superconducting magnets. The low-field instabilities usually reduce the current-carrying capability and thus cause the premature quench of Nb$_3$Sn coils at low magnetic field. In this paper, we explore suppressing the flux jumps by ferromagnetic (FM) layer. Firstly, we experimentally and theoreticall…
▽ More
Flux jumps observed in high-$J_c$ Nb$_3$Sn conductors are urgent problems to construct high field superconducting magnets. The low-field instabilities usually reduce the current-carrying capability and thus cause the premature quench of Nb$_3$Sn coils at low magnetic field. In this paper, we explore suppressing the flux jumps by ferromagnetic (FM) layer. Firstly, we experimentally and theoretically investigate the flux jumps of Nb$_3$Sn/FM hybrid wires exposed to a magnetic field loop with constant sweeping rate. Comparing with bare Nb$_3$Sn and Nb$_3$Sn/Cu wires, we reveal two underlying mechanisms that the suppression of flux jumps is mainly attributed to the thermal effect of FM layer for the case of lower sweeping rate, whereas both thermal and electromagnetic effects play a crucial role for the case of higher sweeping rate. Furthermore, we explore the flux jumps of Nb$_3$Sn/FM hybrid wires exposed to AC magnetic fields with amplitude $B_{a0}$ and frequency $\rmω$. We build up the phase diagrams of flux jumps in the plane $\rmω$-$B_{a0}$ for bare Nb$_{3}$Sn wire, Nb$_{3}$Sn/Cu wire and Nb$_{3}$Sn/FM wire, respectively. We stress that the region of flux jumps of Nb$_{3}$Sn/FM wire is much smaller than the other two wires, which indicates that the Nb$_{3}$Sn/FM wire has significant advantage over merely increasing the heat capacity. The findings shed light on suppression of the flux jumps by utilizing FM materials, which is useful for developing new type of high-$J_c$ Nb$_{3}$Sn conductors.
△ Less
Submitted 4 June, 2024; v1 submitted 10 March, 2024;
originally announced March 2024.
-
Nanoscale variation of the Rashba energy in BiTeI
Authors:
Ruizhe Kang,
Jian-Feng Ge,
Yang He,
Zhihuai Zhu,
Daniel T. Larson,
Mohammed Saghir,
Jason D. Hoffman,
Geetha Balakrishnan,
Jennifer E. Hoffman
Abstract:
BiTeI is a polar semiconductor with strong spin-orbit coupling (SOC) that produces large Rashba spin splitting. Due to its potential utility in spintronics and magnetoelectrics, it is essential to understand how defects impact the spin transport in this material. Using scanning tunneling microscopy and spectroscopy, we image ring-like charging states of single-atom defects on the iodine surface of…
▽ More
BiTeI is a polar semiconductor with strong spin-orbit coupling (SOC) that produces large Rashba spin splitting. Due to its potential utility in spintronics and magnetoelectrics, it is essential to understand how defects impact the spin transport in this material. Using scanning tunneling microscopy and spectroscopy, we image ring-like charging states of single-atom defects on the iodine surface of BiTeI. We observe nanoscale variations in the Rashba energy around each defect, which we correlate with the local electric field extracted from the bias dependence of each ring radius. Our data demonstrate the local impact of atomic defects on the Rashba effect, which is both a challenge and an opportunity for the development of future nanoscale spintronic devices.
△ Less
Submitted 28 March, 2024; v1 submitted 28 February, 2024;
originally announced February 2024.
-
Feature Selection Based on Orthogonal Constraints and Polygon Area
Authors:
Zhenxing Zhang,
Jun Ge,
Zheng Wei,
Chunjie Zhou,
Yilei Wang
Abstract:
The goal of feature selection is to choose the optimal subset of features for a recognition task by evaluating the importance of each feature, thereby achieving effective dimensionality reduction. Currently, proposed feature selection methods often overlook the discriminative dependencies between features and labels. To address this problem, this paper introduces a novel orthogonal regression mode…
▽ More
The goal of feature selection is to choose the optimal subset of features for a recognition task by evaluating the importance of each feature, thereby achieving effective dimensionality reduction. Currently, proposed feature selection methods often overlook the discriminative dependencies between features and labels. To address this problem, this paper introduces a novel orthogonal regression model incorporating the area of a polygon. The model can intuitively capture the discriminative dependencies between features and labels. Additionally, this paper employs a hybrid non-monotone linear search method to efficiently tackle the non-convex optimization challenge posed by orthogonal constraints. Experimental results demonstrate that our approach not only effectively captures discriminative dependency information but also surpasses traditional methods in reducing feature dimensions and enhancing classification performance.
△ Less
Submitted 25 February, 2024;
originally announced February 2024.
-
VistaScenario: Interaction Scenario Engineering for Vehicles with Intelligent Systems for Transport Automation
Authors:
Cheng Chang,
Jiawei Zhang,
Jingwei Ge,
Zuo Zhang,
Junqing Wei,
Li Li,
Fei-Yue Wang
Abstract:
Intelligent vehicles and autonomous driving systems rely on scenario engineering for intelligence and index (I&I), calibration and certification (C&C), and verification and validation (V&V). To extract and index scenarios, various vehicle interactions are worthy of much attention, and deserve refined descriptions and labels. However, existing methods cannot cope well with the problem of scenario c…
▽ More
Intelligent vehicles and autonomous driving systems rely on scenario engineering for intelligence and index (I&I), calibration and certification (C&C), and verification and validation (V&V). To extract and index scenarios, various vehicle interactions are worthy of much attention, and deserve refined descriptions and labels. However, existing methods cannot cope well with the problem of scenario classification and labeling with vehicle interactions as the core. In this paper, we propose VistaScenario framework to conduct interaction scenario engineering for vehicles with intelligent systems for transport automation. Based on the summarized basic types of vehicle interactions, we slice scenario data stream into a series of segments via spatiotemporal scenario evolution tree. We also propose the scenario metric Graph-DTW based on Graph Computation Tree and Dynamic Time Warping to conduct refined scenario comparison and labeling. The extreme interaction scenarios and corner cases can be efficiently filtered and extracted. Moreover, with naturalistic scenario datasets, testing examples on trajectory prediction model demonstrate the effectiveness and advantages of our framework. VistaScenario can provide solid support for the usage and indexing of scenario data, further promote the development of intelligent vehicles and transport automation.
△ Less
Submitted 13 May, 2024; v1 submitted 12 February, 2024;
originally announced February 2024.
-
DeMarking: A Defense for Network Flow Watermarking in Real-Time
Authors:
Yali Yuan,
Jian Ge,
Guang Cheng
Abstract:
The network flow watermarking technique associates the two communicating parties by actively modifying certain characteristics of the stream generated by the sender so that it covertly carries some special marking information. Some curious users communicating with the hidden server as a Tor client may attempt de-anonymization attacks to uncover the real identity of the hidden server by using this…
▽ More
The network flow watermarking technique associates the two communicating parties by actively modifying certain characteristics of the stream generated by the sender so that it covertly carries some special marking information. Some curious users communicating with the hidden server as a Tor client may attempt de-anonymization attacks to uncover the real identity of the hidden server by using this technique. This compromises the privacy of the anonymized communication system. Therefore, we propose a defense scheme against flow watermarking. The scheme is based on deep neural networks and utilizes generative adversarial networks to convert the original Inter-Packet Delays (IPD) into new IPDs generated by the model. We also adopt the concept of adversarial attacks to ensure that the detector will produce an incorrect classification when detecting these new IPDs. This approach ensures that these IPDs are considered "clean", effectively covering the potential watermarks. This scheme is effective against time-based flow watermarking techniques.
△ Less
Submitted 6 February, 2024; v1 submitted 6 February, 2024;
originally announced February 2024.
-
A survey on the DDVV-type inequalities
Authors:
Jianquan Ge,
Fagui Li,
Zizhou Tang,
Yi Zhou
Abstract:
In this paper, we give a survey on the history and recent developments on the DDVV-type inequalities.
In this paper, we give a survey on the history and recent developments on the DDVV-type inequalities.
△ Less
Submitted 1 February, 2024;
originally announced February 2024.
-
Temperature Compensation Method of Fluxgate Sensor Based on Polynomial Fitting
Authors:
Ruiping Yang,
Huan Liu,
Jian Ge,
Daisuke Chugo,
Haobin Dong
Abstract:
Fluxgate sensors are widely used in the field of low frequency and weak vector magnetic field measurement because of their good performance, such as high resolution and low power consumption. However, during the long-term continuous observation, the drift errors of the fluxgate sensor will occur due to the variable ambient temperature. This paper proposes a temperature compensation method for flux…
▽ More
Fluxgate sensors are widely used in the field of low frequency and weak vector magnetic field measurement because of their good performance, such as high resolution and low power consumption. However, during the long-term continuous observation, the drift errors of the fluxgate sensor will occur due to the variable ambient temperature. This paper proposes a temperature compensation method for fluxgate sensors based on polynomial fitting. First, a physical model of the temperature & fluxgate sensor was established on the COMSOL Multiphysics simulation platform, and the influence of temperature on the measurement performance of the fluxgate sensor was analyzed. Second, according to the existing temperature-magnetic field data, a temperature compensation model of the fluxgate sensor was constructed. And compared it with other temperature compensation method, the result shows that the proposed temperature compensation method is relatively simple and can better achieve real-time compensation for sensor application scenarios. Finally, to verify the effectiveness of the proposed method, numerous laboratory experiments were implemented. The temperature drift is reduced from more than 500 nT before compensation to about 1 nT. The results show that the proposed method has a good temperature compensation effect on the data measured by the fluxgate sensor within a variable temperature background.
△ Less
Submitted 14 March, 2024; v1 submitted 24 January, 2024;
originally announced January 2024.
-
Prominence activation, optical flare, and post-flare loops on the RS Canum Venaticorum star SZ Piscium
Authors:
Dongtao Cao,
Shenghong Gu,
Jian Ge,
Tinggui Wang,
Jilin Zhou,
Liang Chang,
U. Wolter,
M. Mittag,
J. H. M. M. Schmitt,
V. Perdelwitz
Abstract:
We present the results of time-resolved high-resolution spectroscopic observations of the very active RS Canum Venaticorum (RS CVn) star SZ Piscium (SZ Psc), obtained during two consecutive observing nights on October 24 and 25, 2011. Several optical chromospheric activity indicators are analyzed using the spectral subtraction technique, which show the remarkably different behavior between two nig…
▽ More
We present the results of time-resolved high-resolution spectroscopic observations of the very active RS Canum Venaticorum (RS CVn) star SZ Piscium (SZ Psc), obtained during two consecutive observing nights on October 24 and 25, 2011. Several optical chromospheric activity indicators are analyzed using the spectral subtraction technique, which show the remarkably different behavior between two nights. Gradually blue-shifted and strengthened excess absorption features presented in the series of the subtracted spectra (especially for the H$_α$, He I D$_{3}$ and H$_β$ lines), as a result of active stellar prominence that is rising its height along the line of our sight, was detected in the observations on October 24. This prominence activation event was probably associated with the subsequently occurred optical flare, and part of that flare decay phase was hunted in the observations on October 25. The flare was characterized by the prominent He I D$_{3}$ line emission, as well as stronger chromospheric emission in the H$_α$, H$_β$ and other active lines. The gradual decay of flare was accompanied by an obviously developmental absorption feature in the blue wing of the H$_α$ and other active lines, which could be explained as cool post-flare loops which projected against the bright flare background. Therefore, a series of possibly associated magnetic activity phenomena, including flare-related prominence activation, optical flare and post-flare loops, were detected during our observations.
△ Less
Submitted 4 January, 2024;
originally announced January 2024.
-
Query-Based Knowledge Sharing for Open-Vocabulary Multi-Label Classification
Authors:
Xuelin Zhu,
Jian Liu,
Dongqi Tang,
Jiawei Ge,
Weijia Liu,
Bo Liu,
Jiuxin Cao
Abstract:
Identifying labels that did not appear during training, known as multi-label zero-shot learning, is a non-trivial task in computer vision. To this end, recent studies have attempted to explore the multi-modal knowledge of vision-language pre-training (VLP) models by knowledge distillation, allowing to recognize unseen labels in an open-vocabulary manner. However, experimental evidence shows that k…
▽ More
Identifying labels that did not appear during training, known as multi-label zero-shot learning, is a non-trivial task in computer vision. To this end, recent studies have attempted to explore the multi-modal knowledge of vision-language pre-training (VLP) models by knowledge distillation, allowing to recognize unseen labels in an open-vocabulary manner. However, experimental evidence shows that knowledge distillation is suboptimal and provides limited performance gain in unseen label prediction. In this paper, a novel query-based knowledge sharing paradigm is proposed to explore the multi-modal knowledge from the pretrained VLP model for open-vocabulary multi-label classification. Specifically, a set of learnable label-agnostic query tokens is trained to extract critical vision knowledge from the input image, and further shared across all labels, allowing them to select tokens of interest as visual clues for recognition. Besides, we propose an effective prompt pool for robust label embedding, and reformulate the standard ranking learning into a form of classification to allow the magnitude of feature vectors for matching, which both significantly benefit label recognition. Experimental results show that our framework significantly outperforms state-of-the-art methods on zero-shot task by 5.9% and 4.5% in mAP on the NUS-WIDE and Open Images, respectively.
△ Less
Submitted 2 January, 2024;
originally announced January 2024.
-
Discovery of Small Ultra-short-period Planets Orbiting KG Dwarfs in Kepler Survey Using GPU Phase Folding and Deep Learning Detection System
Authors:
Kaitlyn Wang,
Jian Ge,
Kevin Willis,
Kevin Wang,
Yinan Zhao,
Quanquan Hu
Abstract:
Of over 5,000 exoplanets identified so far, only a few hundred possess sub-Earth radii. The formation processes of these sub-Earths remain elusive, and acquiring additional samples is essential for investigating this unique population. In our study, we employ the GPFC method, a novel GPU Phase Folding algorithm combined with a Convolutional Neural Network, on Kepler photometry data. This method en…
▽ More
Of over 5,000 exoplanets identified so far, only a few hundred possess sub-Earth radii. The formation processes of these sub-Earths remain elusive, and acquiring additional samples is essential for investigating this unique population. In our study, we employ the GPFC method, a novel GPU Phase Folding algorithm combined with a Convolutional Neural Network, on Kepler photometry data. This method enhances the transit search speed significantly over the traditional Box-fitting Least Squares method, allowing a complete search of the known Kepler KOI data within days using a commercial GPU card. To date, we have identified five new ultra-short-period planets (USPs): Kepler-158d, Kepler-963c, Kepler-879c, Kepler-1489c, and KOI-4978.02. Kepler-879c with a radius of $0.4 R_\oplus$ completes its orbit around a G dwarf in 0.646716 days. Kepler-158d with a radius of $0.43 R_\oplus$ orbits a K dwarf star every 0.645088 days. Kepler-1489c with a radius of $0.51 R_\oplus$ orbits a G dwarf in 0.680741 days. Kepler-963c with a radius of $0.6 R_\oplus$ revolves around a G dwarf in 0.919783 days, and KOI-4978.02 with a radius of $0.7 R_\oplus$ circles a G dwarf in 0.941967 days. Among our findings, Kepler-879c, Kepler-158d and Kepler-963c rank as the first, the third, the fourth smallest USPs identified to date. Notably, Kepler-158d stands as the smallest USP found orbiting K dwarfs while Kepler-963c, Kepler-879c, Kepler-1489c, and KOI-4978.02 are the smallest USPs found orbiting G dwarfs. Kepler-879c, Kepler-158d, Kepler-1489c, and KOI-4978.02 are among the smallest planets that are closest to their host stars, with orbits within 5 stellar radii. In addition, these discoveries highlight GPFC's promising capability in identifying small, new transiting exoplanets within photometry data from Kepler, TESS, and upcoming space transit missions, PLATO and ET.
△ Less
Submitted 14 September, 2024; v1 submitted 28 December, 2023;
originally announced December 2023.
-
Learning from Mistakes: Iterative Prompt Relabeling for Text-to-Image Diffusion Model Training
Authors:
Xinyan Chen,
Jiaxin Ge,
Tianjun Zhang,
Jiaming Liu,
Shanghang Zhang
Abstract:
Diffusion models have shown impressive performance in many domains. However, the model's capability to follow natural language instructions (e.g., spatial relationships between objects, generating complex scenes) is still unsatisfactory. In this work, we propose Iterative Prompt Relabeling (IPR), a novel algorithm that aligns images to text through iterative image sampling and prompt relabeling wi…
▽ More
Diffusion models have shown impressive performance in many domains. However, the model's capability to follow natural language instructions (e.g., spatial relationships between objects, generating complex scenes) is still unsatisfactory. In this work, we propose Iterative Prompt Relabeling (IPR), a novel algorithm that aligns images to text through iterative image sampling and prompt relabeling with feedback. IPR first samples a batch of images conditioned on the text, then relabels the text prompts of unmatched text-image pairs with classifier feedback. We conduct thorough experiments on SDv2 and SDXL, testing their capability to follow instructions on spatial relations. With IPR, we improved up to 15.22% (absolute improvement) on the challenging spatial relation VISOR benchmark, demonstrating superior performance compared to previous RL methods. Our code is publicly available at https://github.com/xinyan-cxy/IPR-RLDF.
△ Less
Submitted 19 March, 2025; v1 submitted 23 December, 2023;
originally announced December 2023.