-
Multistate Density Functional Theory for Local and Charge-Transfer Tripdoublet States from Triplet-Free Radical Interactions
Authors:
Chenyu Liu,
Yang Xu,
Peng Bao,
Yangyi Lu,
Jiali Gao
Abstract:
The interaction between excited states of a closed-shell chromophore and a nearby free radical species gives rise to spin-coupled doublet states, namely singdoublet and tripdoublet, as well as a quartet state. This coupling facilitates transitions that are otherwise spin-forbidden, thereby enhancing intersystem crossing and influencing luminescence and non-radiative decay pathways. In this chapter…
▽ More
The interaction between excited states of a closed-shell chromophore and a nearby free radical species gives rise to spin-coupled doublet states, namely singdoublet and tripdoublet, as well as a quartet state. This coupling facilitates transitions that are otherwise spin-forbidden, thereby enhancing intersystem crossing and influencing luminescence and non-radiative decay pathways. In this chapter, we explore these interactions using multistate density functional theory (MSDFT). By employing a minimal active space (MAS) comprising just ten determinant configurations, MSDFT effectively captures local and charge-transfer excitations with inclusion of correlation effects. MSDFT extends the Hohenberg-Kohn density functional theory from the ground state to encompass all electronic states, underscoring the potential for developing computationally efficient methods to study excited states. Numerical results demonstrate that MSDFT accurately reproduces both qualitative trends and quantitative excited-state energies, in accord with previous studies using extended multistate complete-active-space second-order perturbation theory (XMS-CASPT2). The work explores energy changes along a reaction path from the D_0/D_1 minimum energy crossing intersection to the D_2/D_3 crossing in the exciplex formed by 10-methylphenothiazine and a dicarboximide electron acceptor linked to the stable free radical 2,2,6,6-tetramethylpiperidin-1-oxyl (TEMPO).
△ Less
Submitted 21 March, 2025;
originally announced March 2025.
-
Tri-layer SiN-on-Si 8x8 Optical Switches with Thermo-optic and Electro-optic Actuators
Authors:
Bohao Sun,
Chunhui Yao,
Tongyun Li,
Ziyao Zhang,
Peng Bao,
Minjia Chen,
Alan Yilun Yuan,
Chenxi Tan,
Zhitian Shi,
Adrian Wonfor,
Seb Savory,
Keren Bergman,
Richard Penty,
Qixiang Cheng
Abstract:
We present two spatial-multiplexed switch-and-select (S&S) 8x8 optical switches incorporating a tri-layer SiN-on-Si platform, one equipped with thermo-optic (T-O) and the other electro-optic (E-O) switching elements. To the best of our knowledge, the electro-optic switch fabric is the first-of-its-kind device assembled in such a multi-layer platform. The shuffle between the multiplexer and demulti…
▽ More
We present two spatial-multiplexed switch-and-select (S&S) 8x8 optical switches incorporating a tri-layer SiN-on-Si platform, one equipped with thermo-optic (T-O) and the other electro-optic (E-O) switching elements. To the best of our knowledge, the electro-optic switch fabric is the first-of-its-kind device assembled in such a multi-layer platform. The shuffle between the multiplexer and demultiplexer array is established via a tri-layer Si-SiN-SiN structure, creating a three-dimensional crossing-free photonic shuffle network. At the same time, the implementation of the S&S topology can effectively suppress the first-order crosstalk. The measured on-chip losses for the T-O switch range from 2.1 to 11.5 dB, with a 5.2 dB average, while the E-O device exhibits losses between 8.7 to 19.6 dB, with a 15.1 dB average. Both switches demonstrate ultra-low crosstalk, with measured ranges of 38.9 to 50.8 dB and 42.8 to 51.9 dB, for the T-O and E-O devices respectively. The switching times are 17.6 us for the T-O switch and 5.9 ns with the E-O actuated one. These performance metrics highlight the potential of these switches for next-generation data center applications.
△ Less
Submitted 22 February, 2025; v1 submitted 16 February, 2025;
originally announced February 2025.
-
Optical Convolutional Spectrometer
Authors:
Chunhui Yao,
Jie Ma,
Ningning Wang,
Peng Bao,
Wei Zhuo,
Tao Zhang,
Wanlu Zhang,
Kangning Xu,
Ting Yan,
Liang Ming,
Yuxiao Ye,
Tawfique Hasan,
Ian White,
Richard Penty,
Qixiang Cheng
Abstract:
Optical spectrometers are fundamental across numerous disciplines in science and technology. However, miniaturized versions, while essential for in situ measurements, are often restricted to coarse identification of signature peaks and inadequate for metrological purposes. Here, we introduce a new class of spectrometer, leveraging the convolution theorem as its mathematical foundation. Our convolu…
▽ More
Optical spectrometers are fundamental across numerous disciplines in science and technology. However, miniaturized versions, while essential for in situ measurements, are often restricted to coarse identification of signature peaks and inadequate for metrological purposes. Here, we introduce a new class of spectrometer, leveraging the convolution theorem as its mathematical foundation. Our convolutional spectrometer offers unmatched performance for miniaturized systems and distinct structural and computational simplicity, featuring a centimeter-scale footprint for the fully packaged unit, low cost (~$10) and a 2400 cm-1 (approximately 500 nm) bandwidth. We achieve excellent precision in resolving complex spectra with sub-second sampling and processing time, demonstrating a wide range of applications from industrial and agricultural analysis to healthcare monitoring. Specifically, our spectrometer system classifies diverse solid samples, including plastics, pharmaceuticals, coffee, flour and tea, with 100% success rate, and quantifies concentrations of aqueous and organic solutions with detection accuracy surpassing commercial benchtop spectrometers. We also realize the non-invasive sensing of human biomarkers, such as skin moisture (mean absolute error; MAE = 2.49%), blood alcohol (1.70 mg/dL), blood lactate (0.81 mmol/L), and blood glucose (0.36 mmol/L), highlighting the potential of this new class of spectrometers for low-cost, high-precision, portable/wearable spectral metrology.
△ Less
Submitted 12 February, 2025;
originally announced February 2025.
-
ImageRef-VL: Enabling Contextual Image Referencing in Vision-Language Models
Authors:
Jingwei Yi,
Junhao Yin,
Ju Xu,
Peng Bao,
Yongliang Wang,
Wei Fan,
Hao Wang
Abstract:
Vision-Language Models (VLMs) have demonstrated remarkable capabilities in understanding multimodal inputs and have been widely integrated into Retrieval-Augmented Generation (RAG) based conversational systems. While current VLM-powered chatbots can provide textual source references in their responses, they exhibit significant limitations in referencing contextually relevant images during conversa…
▽ More
Vision-Language Models (VLMs) have demonstrated remarkable capabilities in understanding multimodal inputs and have been widely integrated into Retrieval-Augmented Generation (RAG) based conversational systems. While current VLM-powered chatbots can provide textual source references in their responses, they exhibit significant limitations in referencing contextually relevant images during conversations. In this paper, we introduce Contextual Image Reference -- the ability to appropriately reference relevant images from retrieval documents based on conversation context -- and systematically investigate VLMs' capability in this aspect. We conduct the first evaluation for contextual image referencing, comprising a dedicated testing dataset and evaluation metrics. Furthermore, we propose ImageRef-VL, a method that significantly enhances open-source VLMs' image referencing capabilities through instruction fine-tuning on a large-scale, manually curated multimodal conversation dataset. Experimental results demonstrate that ImageRef-VL not only outperforms proprietary models but also achieves an 88% performance improvement over state-of-the-art open-source VLMs in contextual image referencing tasks. Our code is available at https://github.com/bytedance/ImageRef-VL.
△ Less
Submitted 20 January, 2025;
originally announced January 2025.
-
Vid-Morp: Video Moment Retrieval Pretraining from Unlabeled Videos in the Wild
Authors:
Peijun Bao,
Chenqi Kong,
Zihao Shao,
Boon Poh Ng,
Meng Hwa Er,
Alex C. Kot
Abstract:
Given a natural language query, video moment retrieval aims to localize the described temporal moment in an untrimmed video. A major challenge of this task is its heavy dependence on labor-intensive annotations for training. Unlike existing works that directly train models on manually curated data, we propose a novel paradigm to reduce annotation costs: pretraining the model on unlabeled, real-wor…
▽ More
Given a natural language query, video moment retrieval aims to localize the described temporal moment in an untrimmed video. A major challenge of this task is its heavy dependence on labor-intensive annotations for training. Unlike existing works that directly train models on manually curated data, we propose a novel paradigm to reduce annotation costs: pretraining the model on unlabeled, real-world videos. To support this, we introduce Video Moment Retrieval Pretraining (Vid-Morp), a large-scale dataset collected with minimal human intervention, consisting of over 50K videos captured in the wild and 200K pseudo annotations. Direct pretraining on these imperfect pseudo annotations, however, presents significant challenges, including mismatched sentence-video pairs and imprecise temporal boundaries. To address these issues, we propose the ReCorrect algorithm, which comprises two main phases: semantics-guided refinement and memory-consensus correction. The semantics-guided refinement enhances the pseudo labels by leveraging semantic similarity with video frames to clean out unpaired data and make initial adjustments to temporal boundaries. In the following memory-consensus correction phase, a memory bank tracks the model predictions, progressively correcting the temporal boundaries based on consensus within the memory. Comprehensive experiments demonstrate ReCorrect's strong generalization abilities across multiple downstream settings. Zero-shot ReCorrect achieves over 75% and 80% of the best fully-supervised performance on two benchmarks, while unsupervised ReCorrect reaches about 85% on both. The code, dataset, and pretrained models are available at https://github.com/baopj/Vid-Morp.
△ Less
Submitted 1 December, 2024;
originally announced December 2024.
-
SimBase: A Simple Baseline for Temporal Video Grounding
Authors:
Peijun Bao,
Alex C. Kot
Abstract:
This paper presents SimBase, a simple yet effective baseline for temporal video grounding. While recent advances in temporal grounding have led to impressive performance, they have also driven network architectures toward greater complexity, with a range of methods to (1) capture temporal relationships and (2) achieve effective multimodal fusion. In contrast, this paper explores the question: How…
▽ More
This paper presents SimBase, a simple yet effective baseline for temporal video grounding. While recent advances in temporal grounding have led to impressive performance, they have also driven network architectures toward greater complexity, with a range of methods to (1) capture temporal relationships and (2) achieve effective multimodal fusion. In contrast, this paper explores the question: How effective can a simplified approach be? To investigate, we design SimBase, a network that leverages lightweight, one-dimensional temporal convolutional layers instead of complex temporal structures. For cross-modal interaction, SimBase only employs an element-wise product instead of intricate multimodal fusion. Remarkably, SimBase achieves state-of-the-art results on two large-scale datasets. As a simple yet powerful baseline, we hope SimBase will spark new ideas and streamline future evaluations in temporal video grounding.
△ Less
Submitted 12 November, 2024;
originally announced November 2024.
-
Ultra-low-crosstalk Silicon Switches Driven Thermally and Electrically
Authors:
Peng Bao,
Chunhui Yao,
Chenxi Tan,
Alan Yilun Yuan,
Minjia Chen,
Seb J. Savory,
Richard Penty,
Qixiang Cheng
Abstract:
Silicon photonic switches are widely considered as a cost-effective solution for addressing the ever-growing data traffic in datacenter networks, as they offer unique advantages such as low power consumption, low latency, small footprint and high bandwidth. Despite extensive research efforts, crosstalk in large-scale photonic circuits still poses a threat to the signal integrity. In this paper, we…
▽ More
Silicon photonic switches are widely considered as a cost-effective solution for addressing the ever-growing data traffic in datacenter networks, as they offer unique advantages such as low power consumption, low latency, small footprint and high bandwidth. Despite extensive research efforts, crosstalk in large-scale photonic circuits still poses a threat to the signal integrity. In this paper, we present two designs of silicon Mach-Zehnder Interferometer (MZI) switches achieving ultra-low-crosstalk, driven thermally and electrically. Each switch fabric is optimized at both the device and circuit level to suppress crosstalk and reduce system complexity. Notably, for the first time to the best of our knowledge, we harness the inherent self-heating effect in a carrier-injection-based MZI switch to create a pair of phase shifters that offer arbitrary phase differences. Such a pair of phase shifters induces matched insertion loss at each arm, thus minimizing crosstalk. Experimentally, an ultra-low crosstalk ratio below -40 dB is demonstrated for both thermo-optic (T-O) and electro-optic (E-O) switches. The T-O switch exhibits an on-chip loss of less than 5 dB with a switching time of 500 microseconds, whereas the E-O switch achieves an on-chip loss as low as 8.5 dB with a switching time of under 100 ns. In addition, data transmission of a 50 Gb/s on-off keying signal is demonstrated with high fidelity on the E-O switch, showing the great potential of the proposed switch designs.
△ Less
Submitted 1 October, 2024;
originally announced October 2024.
-
Open-Set Deepfake Detection: A Parameter-Efficient Adaptation Method with Forgery Style Mixture
Authors:
Chenqi Kong,
Anwei Luo,
Peijun Bao,
Haoliang Li,
Renjie Wan,
Zengwei Zheng,
Anderson Rocha,
Alex C. Kot
Abstract:
Open-set face forgery detection poses significant security threats and presents substantial challenges for existing detection models. These detectors primarily have two limitations: they cannot generalize across unknown forgery domains and inefficiently adapt to new data. To address these issues, we introduce an approach that is both general and parameter-efficient for face forgery detection. It b…
▽ More
Open-set face forgery detection poses significant security threats and presents substantial challenges for existing detection models. These detectors primarily have two limitations: they cannot generalize across unknown forgery domains and inefficiently adapt to new data. To address these issues, we introduce an approach that is both general and parameter-efficient for face forgery detection. It builds on the assumption that different forgery source domains exhibit distinct style statistics. Previous methods typically require fully fine-tuning pre-trained networks, consuming substantial time and computational resources. In turn, we design a forgery-style mixture formulation that augments the diversity of forgery source domains, enhancing the model's generalizability across unseen domains. Drawing on recent advancements in vision transformers (ViT) for face forgery detection, we develop a parameter-efficient ViT-based detection model that includes lightweight forgery feature extraction modules and enables the model to extract global and local forgery clues simultaneously. We only optimize the inserted lightweight modules during training, maintaining the original ViT structure with its pre-trained ImageNet weights. This training strategy effectively preserves the informative pre-trained knowledge while flexibly adapting the model to the task of Deepfake detection. Extensive experimental results demonstrate that the designed model achieves state-of-the-art generalizability with significantly reduced trainable parameters, representing an important step toward open-set Deepfake detection in the wild.
△ Less
Submitted 22 August, 2024;
originally announced August 2024.
-
Chip-scale sensor for spectroscopic metrology
Authors:
Chunhui Yao,
Wanlu Zhang,
Peng Bao,
Jie Ma,
Wei Zhuo,
Minjia Chen,
Zhitian Shi,
Jingwen Zhou,
Yuxiao Ye,
Liang Ming,
Ting Yan,
Richard Penty,
Qixiang Cheng
Abstract:
Miniaturized spectrometers hold great promise for in situ, in vitro, and even in vivo sensing applications. However, their size reduction imposes vital performance constraints in meeting the rigorous demands of spectroscopy, including fine resolution, high accuracy, and ultra-wide observation window. The prevailing view in the community holds that miniaturized spectrometers are most suitable for t…
▽ More
Miniaturized spectrometers hold great promise for in situ, in vitro, and even in vivo sensing applications. However, their size reduction imposes vital performance constraints in meeting the rigorous demands of spectroscopy, including fine resolution, high accuracy, and ultra-wide observation window. The prevailing view in the community holds that miniaturized spectrometers are most suitable for the coarse identification of signature peaks. In this paper, we present an integrated reconstructive spectrometer that enables near-infrared (NIR) spectroscopic metrology, and demonstrate a fully packaged sensor with auxiliary electronics. Such a sensor operates over a 520 nm bandwidth together with a resolution of less than 8 pm, which translates into a record-breaking bandwidth-to-resolution ratio of over 65,000. The classification of different types of solid substances and the concentration measurement of aqueous and organic solutions are performed, all achieving approximately 100% accuracy. Notably, the detection limit of our sensor matches that of the commercial benchtop counterparts, which is as low as 0.1% (i.e. 100 mg/dL) for identifying the concentration of glucose solution.
△ Less
Submitted 14 September, 2024; v1 submitted 25 July, 2024;
originally announced July 2024.
-
Single-antenna super-resolution positioning with nonseparable toroidal pulses
Authors:
Ren Wang,
Pan-Yi Bao,
Bing-Zhong Wang,
Yijie Shen
Abstract:
The fundamental principle of satellite or node-based positioning involves triangulating the receiver's coordinates through the intersection of spatial distances. Recent advancements in hybrid wireless networks have yielded high-precision positioning at decimetre-level (wavelength-level) (Nature 611, 473-478 (2022)), approaching the resolution limits in free space. Here, we present a three-dimensio…
▽ More
The fundamental principle of satellite or node-based positioning involves triangulating the receiver's coordinates through the intersection of spatial distances. Recent advancements in hybrid wireless networks have yielded high-precision positioning at decimetre-level (wavelength-level) (Nature 611, 473-478 (2022)), approaching the resolution limits in free space. Here, we present a three-dimensional (3D) super-resolution positioning paradigm in free space by utilizing a novel kind of topologically structured pulses, toroidal electromagnetic pulses (Nat. Photonics 16(7), 523-528 (2022); Sci. Adv. 10(2), eadl1803 (2024)). Excited by the recent compact generator of toroidal pulses and their sophisticated topological and nonseparable structures, we demonstrate that the space-time nonseparability and skyrmion topology inherent in toroidal pulses can be harnessed to achieve freespace microwave 3D positioning with super-resolution accuracy, reaching the centimeter level, using a single emitting antenna. This work opens up new avenues for exploring the potential applications of topological electromagnetic pulses including but not limited to positioning, imaging, and sensing technologies.
△ Less
Submitted 17 May, 2024; v1 submitted 6 May, 2024;
originally announced May 2024.
-
MoE-FFD: Mixture of Experts for Generalized and Parameter-Efficient Face Forgery Detection
Authors:
Chenqi Kong,
Anwei Luo,
Peijun Bao,
Yi Yu,
Haoliang Li,
Zengwei Zheng,
Shiqi Wang,
Alex C. Kot
Abstract:
Deepfakes have recently raised significant trust issues and security concerns among the public. Compared to CNN face forgery detectors, ViT-based methods take advantage of the expressivity of transformers, achieving superior detection performance. However, these approaches still exhibit the following limitations: (1) Fully fine-tuning ViT-based models from ImageNet weights demands substantial comp…
▽ More
Deepfakes have recently raised significant trust issues and security concerns among the public. Compared to CNN face forgery detectors, ViT-based methods take advantage of the expressivity of transformers, achieving superior detection performance. However, these approaches still exhibit the following limitations: (1) Fully fine-tuning ViT-based models from ImageNet weights demands substantial computational and storage resources; (2) ViT-based methods struggle to capture local forgery clues, leading to model bias; (3) These methods limit their scope on only one or few face forgery features, resulting in limited generalizability. To tackle these challenges, this work introduces Mixture-of-Experts modules for Face Forgery Detection (MoE-FFD), a generalized yet parameter-efficient ViT-based approach. MoE-FFD only updates lightweight Low-Rank Adaptation (LoRA) and Adapter layers while keeping the ViT backbone frozen, thereby achieving parameter-efficient training. Moreover, MoE-FFD leverages the expressivity of transformers and local priors of CNNs to simultaneously extract global and local forgery clues. Additionally, novel MoE modules are designed to scale the model's capacity and smartly select optimal forgery experts, further enhancing forgery detection performance. Our proposed learning scheme can be seamlessly adapted to various transformer backbones in a plug-and-play manner. Extensive experimental results demonstrate that the proposed method achieves state-of-the-art face forgery detection performance with significantly reduced parameter overhead. The code is released at: https://github.com/LoveSiameseCat/MoE-FFD.
△ Less
Submitted 7 June, 2024; v1 submitted 12 April, 2024;
originally announced April 2024.
-
Proving correctness for SQL implementations of OCL constraints
Authors:
Hoang Nguyen Phuoc Bao,
Manuel Clavel
Abstract:
In the context of the model-driven development of data-centric applications, OCL constraints play a major role in adding precision to the source models (e.g., data models and security models). Several code-generators have been proposed to bridge the gap between source models with OCL constraints and their corresponding database implementations. However, the database queries produced by these code-…
▽ More
In the context of the model-driven development of data-centric applications, OCL constraints play a major role in adding precision to the source models (e.g., data models and security models). Several code-generators have been proposed to bridge the gap between source models with OCL constraints and their corresponding database implementations. However, the database queries produced by these code-generators are significantly less efficient -- from the point of view of execution-time performance -- than the implementations manually written by database experts. In this paper, we propose a different approach to bridge the gap between models with OCL constraints and their corresponding database implementations. In particular, we introduce a model-based methodology for proving the correctness of manually written SQL implementations of OCL constraints. This methodology is based on a novel mapping from a significant subset of the SQL language into many-sorted first-order logic. Moreover, by leveraging on an already existing mapping from the OCL language into many-sorted first-order logic, we can use SMT solvers to automatically prove the correctness of SQL implementations of OCL constraints. To illustrate and show the applicability of our approach, we include in the paper a number of non-trivial examples. Finally, we report on the status of a suite of tools supporting our approach.
△ Less
Submitted 27 March, 2024;
originally announced March 2024.
-
Multiple-Input Auto-Encoder Guided Feature Selection for IoT Intrusion Detection Systems
Authors:
Phai Vu Dinh,
Diep N. Nguyen,
Dinh Thai Hoang,
Quang Uy Nguyen,
Eryk Dutkiewicz,
Son Pham Bao
Abstract:
While intrusion detection systems (IDSs) benefit from the diversity and generalization of IoT data features, the data diversity (e.g., the heterogeneity and high dimensions of data) also makes it difficult to train effective machine learning models in IoT IDSs. This also leads to potentially redundant/noisy features that may decrease the accuracy of the detection engine in IDSs. This paper first i…
▽ More
While intrusion detection systems (IDSs) benefit from the diversity and generalization of IoT data features, the data diversity (e.g., the heterogeneity and high dimensions of data) also makes it difficult to train effective machine learning models in IoT IDSs. This also leads to potentially redundant/noisy features that may decrease the accuracy of the detection engine in IDSs. This paper first introduces a novel neural network architecture called Multiple-Input Auto-Encoder (MIAE). MIAE consists of multiple sub-encoders that can process inputs from different sources with different characteristics. The MIAE model is trained in an unsupervised learning mode to transform the heterogeneous inputs into lower-dimensional representation, which helps classifiers distinguish between normal behaviour and different types of attacks. To distil and retain more relevant features but remove less important/redundant ones during the training process, we further design and embed a feature selection layer right after the representation layer of MIAE resulting in a new model called MIAEFS. This layer learns the importance of features in the representation vector, facilitating the selection of informative features from the representation vector. The results on three IDS datasets, i.e., NSLKDD, UNSW-NB15, and IDS2017, show the superior performance of MIAE and MIAEFS compared to other methods, e.g., conventional classifiers, dimensionality reduction models, unsupervised representation learning methods with different input dimensions, and unsupervised feature selection models. Moreover, MIAE and MIAEFS combined with the Random Forest (RF) classifier achieve accuracy of 96.5% in detecting sophisticated attacks, e.g., Slowloris. The average running time for detecting an attack sample using RF with the representation of MIAE and MIAEFS is approximate 1.7E-6 seconds, whilst the model size is lower than 1 MB.
△ Less
Submitted 21 March, 2024;
originally announced March 2024.
-
Benchmarking reconstructive spectrometer with multi-resonant cavities
Authors:
Chunhui Yao,
Kangning Xu,
Tianhua Lin,
Jie Ma,
Chumeng Yao,
Peng Bao,
Zhitian Shi,
Richard Penty,
Qixiang Cheng
Abstract:
Recent years have seen the rapid development of miniaturized reconstructive spectrometers (RSs), yet they still confront a range of technical challenges, such as bandwidth/resolution ratio, sensing speed, and/or power efficiency. Reported RS designs often suffer from insufficient decorrelation between sampling channels, which results in limited compressive sampling efficiency, in essence, due to i…
▽ More
Recent years have seen the rapid development of miniaturized reconstructive spectrometers (RSs), yet they still confront a range of technical challenges, such as bandwidth/resolution ratio, sensing speed, and/or power efficiency. Reported RS designs often suffer from insufficient decorrelation between sampling channels, which results in limited compressive sampling efficiency, in essence, due to inadequate engineering of sampling responses. This in turn leads to poor spectral-pixel-to-channel ratios (SPCRs), typically restricted at single digits. So far, there lacks a general guideline for manipulating RS sampling responses for the effectiveness of spectral information acquisition. In this study, we shed light on a fundamental parameter from the compressive sensing theory - the average mutual correlation coefficient v - and provide insight into how it serves as a critical benchmark in RS design with regards to the SPCR and reconstruction accuracy. To this end, we propose a novel RS design with multi-resonant cavities, consisting of a series of partial reflective interfaces. Such multi-cavity configuration offers an expansive parameter space, facilitating the superlative optimization of sampling matrices with minimized v. As a proof-of-concept demonstration, a single-shot, dual-band RS is implemented on a SiN platform, tailored for capturing signature spectral shapes across different wavelength regions, with customized photonic crystal nanobeam mirrors. Experimentally, the device demonstrates an overall operation bandwidth of 270 nm and a <0.5 nm resolution with only 15 sampling channels per band, leading to a record high SPCR of 18.0. Moreover, the proposed multi-cavity design can be readily adapted to various photonic platforms. For instance, we showcase that by employing multi-layer coatings, an ultra-broadband RS can be optimized to exhibit a 700 nm bandwidth with an SPCR of over 100.
△ Less
Submitted 1 March, 2024;
originally announced March 2024.
-
Ultrafast Excited-State Energy Transfer in Phenylene Ethynylene Dendrimer: Quantum Dynamics with Tensor Network Method
Authors:
Sisi Liu,
Jiawei Peng,
Peng Bao,
Qiang Shi,
Zhenggang Lan
Abstract:
Photo-induced excited-state energy transfer (EET) processes play an important role in the solar energy conversions. The phenylene ethynylene (PE) dendrimers display great potential in improving the efficiency of solar cells, because of their excellent photo-harvesting and exciton-transport properties. In this work, we investigated the intramolecular EET dynamics in a dendrimer composed of two line…
▽ More
Photo-induced excited-state energy transfer (EET) processes play an important role in the solar energy conversions. The phenylene ethynylene (PE) dendrimers display great potential in improving the efficiency of solar cells, because of their excellent photo-harvesting and exciton-transport properties. In this work, we investigated the intramolecular EET dynamics in a dendrimer composed of two linear PE units (2-ring and 3-ring) using the full quantum dynamics based on the tensor network method. We first constructed a diabatic model Hamiltonian based on the electronic structure calculations. Using this diabatic vibronic coupling model, we tried to obtain the main features of the EET dynamics in terms of the several diabatic models with different numbers of vibrational modes (from 4 modes to 129 modes) and to explore the corresponding vibronic coupling interactions. The results show that the EET in the current PE dendrimer is an ultrafast process. Four modes with A' symmetry play dominant roles in the dynamics, other 86 modes with A' symmetry can damp the electronic coherence, and the modes of A" symmetry do not show the significant influence on the EET process. Overall, the first-order intrastate vibronic coupling terms show the dominant roles in the EET dynamics, while the second-order intrastate vibronic coupling terms give the visible impact here by damping the electronic coherence and slowing down the overall EET process. This work provides a valuable understanding of the physical insight in the EET dynamics of PE dendrimers.
△ Less
Submitted 16 January, 2024;
originally announced January 2024.
-
Constrained Twin Variational Auto-Encoder for Intrusion Detection in IoT Systems
Authors:
Phai Vu Dinh,
Quang Uy Nguyen,
Dinh Thai Hoang,
Diep N. Nguyen,
Son Pham Bao,
Eryk Dutkiewicz
Abstract:
Intrusion detection systems (IDSs) play a critical role in protecting billions of IoT devices from malicious attacks. However, the IDSs for IoT devices face inherent challenges of IoT systems, including the heterogeneity of IoT data/devices, the high dimensionality of training data, and the imbalanced data. Moreover, the deployment of IDSs on IoT systems is challenging, and sometimes impossible, d…
▽ More
Intrusion detection systems (IDSs) play a critical role in protecting billions of IoT devices from malicious attacks. However, the IDSs for IoT devices face inherent challenges of IoT systems, including the heterogeneity of IoT data/devices, the high dimensionality of training data, and the imbalanced data. Moreover, the deployment of IDSs on IoT systems is challenging, and sometimes impossible, due to the limited resources such as memory/storage and computing capability of typical IoT devices. To tackle these challenges, this article proposes a novel deep neural network/architecture called Constrained Twin Variational Auto-Encoder (CTVAE) that can feed classifiers of IDSs with more separable/distinguishable and lower-dimensional representation data. Additionally, in comparison to the state-of-the-art neural networks used in IDSs, CTVAE requires less memory/storage and computing power, hence making it more suitable for IoT IDS systems. Extensive experiments with the 11 most popular IoT botnet datasets show that CTVAE can boost around 1% in terms of accuracy and Fscore in detection attack compared to the state-of-the-art machine learning and representation learning methods, whilst the running time for attack detection is lower than 2E-6 seconds and the model size is lower than 1 MB. We also further investigate various characteristics of CTVAE in the latent space and in the reconstruction representation to demonstrate its efficacy compared with current well-known methods.
△ Less
Submitted 4 December, 2023;
originally announced December 2023.
-
Large Language Models Illuminate a Progressive Pathway to Artificial Healthcare Assistant: A Review
Authors:
Mingze Yuan,
Peng Bao,
Jiajia Yuan,
Yunhao Shen,
Zifan Chen,
Yi Xie,
Jie Zhao,
Yang Chen,
Li Zhang,
Lin Shen,
Bin Dong
Abstract:
With the rapid development of artificial intelligence, large language models (LLMs) have shown promising capabilities in mimicking human-level language comprehension and reasoning. This has sparked significant interest in applying LLMs to enhance various aspects of healthcare, ranging from medical education to clinical decision support. However, medicine involves multifaceted data modalities and n…
▽ More
With the rapid development of artificial intelligence, large language models (LLMs) have shown promising capabilities in mimicking human-level language comprehension and reasoning. This has sparked significant interest in applying LLMs to enhance various aspects of healthcare, ranging from medical education to clinical decision support. However, medicine involves multifaceted data modalities and nuanced reasoning skills, presenting challenges for integrating LLMs. This paper provides a comprehensive review on the applications and implications of LLMs in medicine. It begins by examining the fundamental applications of general-purpose and specialized LLMs, demonstrating their utilities in knowledge retrieval, research support, clinical workflow automation, and diagnostic assistance. Recognizing the inherent multimodality of medicine, the review then focuses on multimodal LLMs, investigating their ability to process diverse data types like medical imaging and EHRs to augment diagnostic accuracy. To address LLMs' limitations regarding personalization and complex clinical reasoning, the paper explores the emerging development of LLM-powered autonomous agents for healthcare. Furthermore, it summarizes the evaluation methodologies for assessing LLMs' reliability and safety in medical contexts. Overall, this review offers an extensive analysis on the transformative potential of LLMs in modern medicine. It also highlights the pivotal need for continuous optimizations and ethical oversight before these models can be effectively integrated into clinical practice. Visit https://github.com/mingze-yuan/Awesome-LLM-Healthcare for an accompanying GitHub repository containing latest papers.
△ Less
Submitted 3 November, 2023;
originally announced November 2023.
-
Free-Space Propagation and Skyrmion Topology of Toroidal Electromagnetic Pulses
Authors:
Ren Wang,
Zhi-Qiang Hu,
Pan-Yi Bao,
Shuai Shi,
Bing-Zhong Wang,
Nikolay I. Zheludev,
Yijie Shen
Abstract:
Toroidal electromagnetic pulses have been recently reported as nontransverse, space-time nonseparable topological excitations of free space [Nat. Photon. 16, 523-528 (2022)]. However, their propagation dynamics and topological configurations have not been comprehensively experimentally characterized. Here, we report that microwave toroidal pulses can be launched by a broadband conical horn antenna…
▽ More
Toroidal electromagnetic pulses have been recently reported as nontransverse, space-time nonseparable topological excitations of free space [Nat. Photon. 16, 523-528 (2022)]. However, their propagation dynamics and topological configurations have not been comprehensively experimentally characterized. Here, we report that microwave toroidal pulses can be launched by a broadband conical horn antenna. We experimentally map their skyrmionic textures and demonstrate how that during propagation the pulses evolves towards stronger space-time nonseparability and closer proximity to the canonical Hellwarth and Nouchi toroidal pulses.
△ Less
Submitted 3 November, 2023;
originally announced November 2023.
-
Studies of Nonadiabatic Dynamics in the Singlet Fission Processes of Pentacene Dimer via Tensor Train Decomposition Method
Authors:
Jiawei Peng,
Deping Hu,
Hong Liu,
Qiang Shi,
Peng Bao,
Zhenggang Lan
Abstract:
Singlet fission (SF) is a very significant photophysical phenomenon and possesses potential applications. In this work, we try to give the rather detailed theoretical investigation of the SF process in the stacked polyacene dimer by combining the high-level quantum chemistry calculations, and the quantum dynamics simulations based on the tensor train decomposition method. Starting from the constru…
▽ More
Singlet fission (SF) is a very significant photophysical phenomenon and possesses potential applications. In this work, we try to give the rather detailed theoretical investigation of the SF process in the stacked polyacene dimer by combining the high-level quantum chemistry calculations, and the quantum dynamics simulations based on the tensor train decomposition method. Starting from the construction of the linear vibronic coupling model, we explore the pure electronic dynamics and the vibronic dynamics in the SF processes. The role of vibrational modes in nonadiabatic dynamics is addressed. The results show that the super-exchange mechanism mediated by the charge-transfer state is found in both pure electronic dynamics and the nonadiabatic dynamics. Particularly, the vibrational modes with the frequency resonance with the adiabatic energy gap play very import roles in the SF dynamics. This work not only provides a deep and detailed understanding of the SF process, but also verifies the efficiency of the tensor train decomposition method that can serve as the reference dynamics method to explore the dynamics behaviors of complex systems.
△ Less
Submitted 30 August, 2023;
originally announced August 2023.
-
Significant-attributed Community Search in Heterogeneous Information Networks
Authors:
Yanghao Liu,
Fangda Guo,
Bingbing Xu,
Peng Bao,
Huawei Shen,
Xueqi Cheng
Abstract:
Community search is a personalized community discovery problem aimed at finding densely-connected subgraphs containing the query vertex. In particular, the search for communities with high-importance vertices has recently received a great deal of attention. However, existing works mainly focus on conventional homogeneous networks where vertices are of the same type, but are not applicable to heter…
▽ More
Community search is a personalized community discovery problem aimed at finding densely-connected subgraphs containing the query vertex. In particular, the search for communities with high-importance vertices has recently received a great deal of attention. However, existing works mainly focus on conventional homogeneous networks where vertices are of the same type, but are not applicable to heterogeneous information networks (HINs) composed of multi-typed vertices and different semantic relations, such as bibliographic networks. In this paper, we study the problem of high-importance community search in HINs. A novel community model is introduced, named heterogeneous significant community (HSC), to unravel the closely connected vertices of the same type with high attribute values through multiple semantic relationships. An HSC not only maximizes the exploration of indirect relationships across entities of the anchor-type but incorporates their significance. To search the HSCs, we first develop online algorithms by exploiting both segmented-based meta-path expansion and significance increment. Specially, a solution space reuse strategy based on structural nesting is designed to boost the efficiency. In addition, we further devise a two-level index to support searching HSCs in optimal time, based on which a space-efficient compact index is proposed. Extensive experiments on real-world large-scale HINs demonstrate that our solutions are effective and efficient for searching HSCs, and the index-based algorithms are 2-4 orders of magnitude faster than online algorithms.
△ Less
Submitted 25 August, 2023;
originally announced August 2023.
-
Deep Reinforcement Learning for Beam Angle Optimization of Intensity-Modulated Radiation Therapy
Authors:
Peng Bao,
Gong Wang,
Ruijie Yang,
Bin Dong
Abstract:
Objective: Intensity-modulated radiation therapy (IMRT) beam angle optimization (BAO) is a challenging combinatorial optimization problem that is NP-hard. In this study, we aim to develop a personalized BAO algorithm for IMRT that improves the quality of the final treatment. Methods: To improve the quality of IMRT treatment planning, we propose a deep reinforcement learning (DRL)-based approach fo…
▽ More
Objective: Intensity-modulated radiation therapy (IMRT) beam angle optimization (BAO) is a challenging combinatorial optimization problem that is NP-hard. In this study, we aim to develop a personalized BAO algorithm for IMRT that improves the quality of the final treatment. Methods: To improve the quality of IMRT treatment planning, we propose a deep reinforcement learning (DRL)-based approach for IMRT BAO. We consider the task as a sequential decision-making problem and formulate it as a Markov Decision Process. To facilitate the training process, a 3D-Unet is designed to predict the dose distribution for the different number of beam angles, ranging from 1 to 9, to simulate the IMRT environment. By leveraging the simulation model, double deep-Q network (DDQN) and proximal policy optimization (PPO) are used to train agents to select the personalized beam angle sequentially within a few seconds. Results: The treatment plans with beam angles selected by DRL outperform those with clinically used evenly distributed beam angles. For DDQN, the overall average improvement of the CIs is 0.027, 0.032, and 0.03 for 5, 7, and 9 beam angles respectively. For PPO, the overall average improvement of CIs is 0.045, 0.051, and 0.025 for 5, 7, and 9 beam angles respectively. Conclusion: The proposed DRL-based beam angle selection strategy can generate personalized beam angles within a few seconds, and the resulting treatment plan is superior to that obtained using evenly distributed angles. Significance: A fast and automated personalized beam angle selection approach is been proposed for IMRT BAO.
△ Less
Submitted 7 March, 2023;
originally announced March 2023.
-
Learning Sample Importance for Cross-Scenario Video Temporal Grounding
Authors:
Peijun Bao,
Yadong Mu
Abstract:
The task of temporal grounding aims to locate video moment in an untrimmed video, with a given sentence query. This paper for the first time investigates some superficial biases that are specific to the temporal grounding task, and proposes a novel targeted solution. Most alarmingly, we observe that existing temporal ground models heavily rely on some biases (e.g., high preference on frequent conc…
▽ More
The task of temporal grounding aims to locate video moment in an untrimmed video, with a given sentence query. This paper for the first time investigates some superficial biases that are specific to the temporal grounding task, and proposes a novel targeted solution. Most alarmingly, we observe that existing temporal ground models heavily rely on some biases (e.g., high preference on frequent concepts or certain temporal intervals) in the visual modal. This leads to inferior performance when generalizing the model in cross-scenario test setting. To this end, we propose a novel method called Debiased Temporal Language Localizer (DebiasTLL) to prevent the model from naively memorizing the biases and enforce it to ground the query sentence based on true inter-modal relationship. Debias-TLL simultaneously trains two models. By our design, a large discrepancy of these two models' predictions when judging a sample reveals higher probability of being a biased sample. Harnessing the informative discrepancy, we devise a data re-weighing scheme for mitigating the data biases. We evaluate the proposed model in cross-scenario temporal grounding, where the train / test data are heterogeneously sourced. Experiments show large-margin superiority of the proposed method in comparison with state-of-the-art competitors.
△ Less
Submitted 8 January, 2022;
originally announced January 2022.
-
Lifelong Vehicle Trajectory Prediction Framework Based on Generative Replay
Authors:
Peng Bao,
Zonghai Chen,
Jikai Wang,
Deyun Dai,
Hao Zhao
Abstract:
Accurate trajectory prediction of vehicles is essential for reliable autonomous driving. To maintain consistent performance as a vehicle driving around different cities, it is crucial to adapt to changing traffic circumstances and achieve lifelong trajectory prediction model. To realize it, catastrophic forgetting is a main problem to be addressed. In this paper, a divergence measurement method ba…
▽ More
Accurate trajectory prediction of vehicles is essential for reliable autonomous driving. To maintain consistent performance as a vehicle driving around different cities, it is crucial to adapt to changing traffic circumstances and achieve lifelong trajectory prediction model. To realize it, catastrophic forgetting is a main problem to be addressed. In this paper, a divergence measurement method based on conditional Kullback-Leibler divergence is proposed first to evaluate spatiotemporal dependency difference among varied driving circumstances. Then based on generative replay, a novel lifelong vehicle trajectory prediction framework is developed. The framework consists of a conditional generation model and a vehicle trajectory prediction model. The conditional generation model is a generative adversarial network conditioned on position configuration of vehicles. After learning and merging trajectory distribution of vehicles across different cities, the generation model replays trajectories with prior samplings as inputs, which alleviates catastrophic forgetting. The vehicle trajectory prediction model is trained by the replayed trajectories and achieves consistent prediction performance on visited cities. A lifelong experiment setup is established on four open datasets including five tasks. Spatiotemporal dependency divergence is calculated for different tasks. Even though these divergence, the proposed framework exhibits lifelong learning ability and achieves consistent performance on all tasks.
△ Less
Submitted 14 November, 2021;
originally announced November 2021.
-
Secure Transmission with Different Security Requirements Based on Covert Communication and Information-Theoretic Security in Presence of Friendly Jammer
Authors:
Pooya Baee,
Farid Samsami khodadad,
Moslem Forouzesh
Abstract:
In this paper, we investigate joint information-theoretic security and covert communication on a network in the presence of a single transmitter (Alice), a friendly jammer, a single untrusted user, two legitimate users, and a single warden of the channel (Willie). In the considered network, one of the authorized users, Bob, needs a secure and covert communication, and therefore his message must be…
▽ More
In this paper, we investigate joint information-theoretic security and covert communication on a network in the presence of a single transmitter (Alice), a friendly jammer, a single untrusted user, two legitimate users, and a single warden of the channel (Willie). In the considered network, one of the authorized users, Bob, needs a secure and covert communication, and therefore his message must be sent securely, and at the same time, the existence of his communication with the transmitter should not be detected by the channel's warden, Willie, Meanwhile, another authorized user, Carol, needs covert communication. The purpose of secure communication is to prevent the message being decoded by the untrusted user who is present on the network, which leads us to use one of the physical layer security methods, named the secure transmission of information theory. In some cases, in addition to protecting the content of the message, it is important for the user that the existence of the transmission not being detected by an adversary, which leads us to covert communication. In the proposed network model, it is assumed that for covert communication requirements, Alice will not send any messages to legitimate users in one time slot and in another time slot will send to them both (Bob and Carol). One of the main challenges in covert communication is low transmission rate, because we have to reduce the transmission power such that the main message get hide in background noise.
△ Less
Submitted 1 July, 2021;
originally announced July 2021.
-
Recent developments in the PySCF program package
Authors:
Qiming Sun,
Xing Zhang,
Samragni Banerjee,
Peng Bao,
Marc Barbry,
Nick S. Blunt,
Nikolay A. Bogdanov,
George H. Booth,
Jia Chen,
Zhi-Hao Cui,
Janus Juul Eriksen,
Yang Gao,
Sheng Guo,
Jan Hermann,
Matthew R. Hermes,
Kevin Koh,
Peter Koval,
Susi Lehtola,
Zhendong Li,
Junzi Liu,
Narbe Mardirossian,
James D. McClain,
Mario Motta,
Bastien Mussard,
Hung Q. Pham
, et al. (24 additional authors not shown)
Abstract:
PYSCF is a Python-based general-purpose electronic structure platform that both supports first-principles simulations of molecules and solids, as well as accelerates the development of new methodology and complex computational workflows. The present paper explains the design and philosophy behind PYSCF that enables it to meet these twin objectives. With several case studies, we show how users can…
▽ More
PYSCF is a Python-based general-purpose electronic structure platform that both supports first-principles simulations of molecules and solids, as well as accelerates the development of new methodology and complex computational workflows. The present paper explains the design and philosophy behind PYSCF that enables it to meet these twin objectives. With several case studies, we show how users can easily implement their own methods using PYSCF as a development environment. We then summarize the capabilities of PYSCF for molecular and solid-state simulations. Finally, we describe the growing ecosystem of projects that use PYSCF across the domains of quantum chemistry, materials science, machine learning and quantum information science.
△ Less
Submitted 10 July, 2020; v1 submitted 27 February, 2020;
originally announced February 2020.
-
Diabatic States,Couplings and Potential Energy Surfaces through the Block Localized Excitation Method
Authors:
Peng Bao,
Qiang Shi,
Jiali Gao
Abstract:
We propose a new block localized excitation (BLE) method to directly construct diabatic excited states without the need to first compute the adiabatic states. The new method is capable to keep any electrons, spins, and excitations localized in any divided blocks of intermolecular and intramolecular systems. At the same time, the electrostatic, exchange, and polarization interactions between differ…
▽ More
We propose a new block localized excitation (BLE) method to directly construct diabatic excited states without the need to first compute the adiabatic states. The new method is capable to keep any electrons, spins, and excitations localized in any divided blocks of intermolecular and intramolecular systems. At the same time, the electrostatic, exchange, and polarization interactions between different blocks can be fully taken account of. To achieve this, a new delta-SCF project method and the maximum wavefunction overlap method are employed to obtain localized excited states with orbitals relaxation, and the coupling between them are obtained using approaches similar to the multistate DFT (MSDFT) method. Numerical results show that the new BLE method is accurate in calculating the electronic couplings of the singlet excitation energy transfer (SEET) and triplet energy excitation transfer (TEET) processes, as well as the excited-state intermolecular potential energy surface.
△ Less
Submitted 18 January, 2020;
originally announced January 2020.
-
MD-Recon-Net: A Parallel Dual-Domain Convolutional Neural Network for Compressed Sensing MRI
Authors:
Maosong Ran,
Wenjun Xia,
Yongqiang Huang,
Zexin Lu,
Peng Bao,
Yan Liu,
Huaiqiang Sun,
Jiliu Zhou,
Yi Zhang
Abstract:
Compressed sensing magnetic resonance imaging (CS-MRI) is a theoretical framework that can accurately reconstruct images from undersampled k-space data with a much lower sampling rate than the one set by the classical Nyquist-Shannon sampling theorem. Therefore, CS-MRI can efficiently accelerate acquisition time and relieve the psychological burden on patients while maintaining high imaging qualit…
▽ More
Compressed sensing magnetic resonance imaging (CS-MRI) is a theoretical framework that can accurately reconstruct images from undersampled k-space data with a much lower sampling rate than the one set by the classical Nyquist-Shannon sampling theorem. Therefore, CS-MRI can efficiently accelerate acquisition time and relieve the psychological burden on patients while maintaining high imaging quality. The problems with traditional CS-MRI reconstruction are solved by iterative numerical solvers, which usually suffer from expensive computational cost and the lack of accurate handcrafted priori. In this paper, inspired by deep learning's (DL's) fast inference and excellent end-to-end performance, we propose a novel cascaded convolutional neural network called MD-Recon-Net to facilitate fast and accurate MRI reconstruction. Especially, different from existing DL-based methods, which operate on single domain data or both domains in a certain order, our proposed MD-Recon-Net contains two parallel and interactive branches that simultaneously perform on k-space and spatial-domain data, exploring the latent relationship between k-space and the spatial domain. The simulated experimental results show that the proposed method not only achieves competitive visual effects to several state-of-the-art methods, but also outperforms other DL-based methods in terms of model scale and computational cost.
△ Less
Submitted 18 May, 2020; v1 submitted 23 October, 2019;
originally announced October 2019.
-
Human Vocal Sentiment Analysis
Authors:
Andrew Huang,
Puwei Bao
Abstract:
In this paper, we use several techniques with conventional vocal feature extraction (MFCC, STFT), along with deep-learning approaches such as CNN, and also context-level analysis, by providing the textual data, and combining different approaches for improved emotion-level classification. We explore models that have not been tested to gauge the difference in performance and accuracy. We apply hyper…
▽ More
In this paper, we use several techniques with conventional vocal feature extraction (MFCC, STFT), along with deep-learning approaches such as CNN, and also context-level analysis, by providing the textual data, and combining different approaches for improved emotion-level classification. We explore models that have not been tested to gauge the difference in performance and accuracy. We apply hyperparameter sweeps and data augmentation to improve performance. Finally, we see if a real-time approach is feasible, and can be readily integrated into existing systems.
△ Less
Submitted 19 May, 2019;
originally announced May 2019.
-
Convolutional Sparse Coding for Compressed Sensing CT Reconstruction
Authors:
Peng Bao,
Wenjun Xia,
Kang Yang,
Weiyan Chen,
Mianyi Chen,
Yan Xi,
Shanzhou Niu,
Jiliu Zhou,
He Zhang,
Huaiqiang Sun,
Zhangyang Wang,
Yi Zhang
Abstract:
Over the past few years, dictionary learning (DL)-based methods have been successfully used in various image reconstruction problems. However, traditional DL-based computed tomography (CT) reconstruction methods are patch-based and ignore the consistency of pixels in overlapped patches. In addition, the features learned by these methods always contain shifted versions of the same features. In rece…
▽ More
Over the past few years, dictionary learning (DL)-based methods have been successfully used in various image reconstruction problems. However, traditional DL-based computed tomography (CT) reconstruction methods are patch-based and ignore the consistency of pixels in overlapped patches. In addition, the features learned by these methods always contain shifted versions of the same features. In recent years, convolutional sparse coding (CSC) has been developed to address these problems. In this paper, inspired by several successful applications of CSC in the field of signal processing, we explore the potential of CSC in sparse-view CT reconstruction. By directly working on the whole image, without the necessity of dividing the image into overlapped patches in DL-based methods, the proposed methods can maintain more details and avoid artifacts caused by patch aggregation. With predetermined filters, an alternating scheme is developed to optimize the objective function. Extensive experiments with simulated and real CT data were performed to validate the effectiveness of the proposed methods. Qualitative and quantitative results demonstrate that the proposed methods achieve better performance than several existing state-of-the-art methods.
△ Less
Submitted 20 March, 2019;
originally announced March 2019.
-
Sparse-View CT Reconstruction via Convolutional Sparse Coding
Authors:
Peng Bao,
Wenjun Xia,
Kang Yang,
Jiliu Zhou,
Yi Zhang
Abstract:
Traditional dictionary learning based CT reconstruction methods are patch-based and the features learned with these methods often contain shifted versions of the same features. To deal with these problems, the convolutional sparse coding (CSC) has been proposed and introduced into various applications. In this paper, inspired by the successful applications of CSC in the field of signal processing,…
▽ More
Traditional dictionary learning based CT reconstruction methods are patch-based and the features learned with these methods often contain shifted versions of the same features. To deal with these problems, the convolutional sparse coding (CSC) has been proposed and introduced into various applications. In this paper, inspired by the successful applications of CSC in the field of signal processing, we propose a novel sparse-view CT reconstruction method based on CSC with gradient regularization on feature maps. By directly working on whole image, which need not to divide the image into overlapped patches like dictionary learning based methods, the proposed method can maintain more details and avoid the artifacts caused by patch aggregation. Experimental results demonstrate that the proposed method has better performance than several existing algorithms in both qualitative and quantitative aspects.
△ Less
Submitted 15 October, 2018;
originally announced October 2018.
-
Few-View CT Reconstruction with Group-Sparsity Regularization
Authors:
Peng Bao,
Jiliu Zhou,
Yi Zhang
Abstract:
Classical total variation (TV) based iterative reconstruction algorithms assume that the signal is piecewise smooth, which causes reconstruction results to suffer from the over-smoothing effect. To address this problem, this work presents a novel computed tomography (CT) reconstruction method for the few-view problem called the group-sparsity regularization-based simultaneous algebraic reconstruct…
▽ More
Classical total variation (TV) based iterative reconstruction algorithms assume that the signal is piecewise smooth, which causes reconstruction results to suffer from the over-smoothing effect. To address this problem, this work presents a novel computed tomography (CT) reconstruction method for the few-view problem called the group-sparsity regularization-based simultaneous algebraic reconstruction technique (GSR-SART). Group-based sparse representation, which utilizes the concept of a group as the basic unit of sparse representation instead of a patch, is introduced as the image domain prior regularization term to eliminate the over-smoothing effect. By grouping the nonlocal patches into different clusters with similarity measured by Euclidean distance, the sparsity and nonlocal similarity in a single image are simultaneously explored. The split Bregman iteration algorithm is applied to obtain the numerical scheme. Experimental results demonstrate that our method both qualitatively and quantitatively outperforms several existing reconstruction methods, including filtered back projection, expectation maximization, SART, and TV-based projections onto convex sets.
△ Less
Submitted 5 March, 2018;
originally announced March 2018.
-
ASMCNN: An Efficient Brain Extraction Using Active Shape Model and Convolutional Neural Networks
Authors:
Duy H. M. Nguyen,
Duy M. Nguyen,
Mai T. N. Truong,
Thu Nguyen,
Khanh T. Tran,
Nguyen A. Triet,
Pham T. Bao,
Binh T. Nguyen
Abstract:
Brain extraction (skull stripping) is a challenging problem in neuroimaging. It is due to the variability in conditions from data acquisition or abnormalities in images, making brain morphology and intensity characteristics changeable and complicated. In this paper, we propose an algorithm for skull stripping in Magnetic Resonance Imaging (MRI) scans, namely ASMCNN, by combining the Active Shape M…
▽ More
Brain extraction (skull stripping) is a challenging problem in neuroimaging. It is due to the variability in conditions from data acquisition or abnormalities in images, making brain morphology and intensity characteristics changeable and complicated. In this paper, we propose an algorithm for skull stripping in Magnetic Resonance Imaging (MRI) scans, namely ASMCNN, by combining the Active Shape Model (ASM) and Convolutional Neural Network (CNN) for taking full of their advantages to achieve remarkable results. Instead of working with 3D structures, we process 2D image sequences in the sagittal plane. First, we divide images into different groups such that, in each group, shapes and structures of brain boundaries have similar appearances. Second, a modified version of ASM is used to detect brain boundaries by utilizing prior knowledge of each group. Finally, CNN and post-processing methods, including Conditional Random Field (CRF), Gaussian processes, and several special rules are applied to refine the segmentation contours. Experimental results show that our proposed method outperforms current state-of-the-art algorithms by a significant margin in all experiments.
△ Less
Submitted 27 January, 2022; v1 submitted 5 February, 2018;
originally announced February 2018.
-
Complex Matrix Factorization for Face Recognition
Authors:
Viet-Hang Duong,
Yuan-Shan Lee,
Bach-Tung Pham,
Seksan Mathulaprangsan,
Pham The Bao,
Jia-Ching Wang
Abstract:
This work developed novel complex matrix factorization methods for face recognition; the methods were complex matrix factorization (CMF), sparse complex matrix factorization (SpaCMF), and graph complex matrix factorization (GraCMF). After real-valued data are transformed into a complex field, the complex-valued matrix will be decomposed into two matrices of bases and coefficients, which are derive…
▽ More
This work developed novel complex matrix factorization methods for face recognition; the methods were complex matrix factorization (CMF), sparse complex matrix factorization (SpaCMF), and graph complex matrix factorization (GraCMF). After real-valued data are transformed into a complex field, the complex-valued matrix will be decomposed into two matrices of bases and coefficients, which are derived from solutions to an optimization problem in a complex domain. The generated objective function is the real-valued function of the reconstruction error, which produces a parametric description. Factorizing the matrix of complex entries directly transformed the constrained optimization problem into an unconstrained optimization problem. Additionally, a complex vector space with N dimensions can be regarded as a 2N-dimensional real vector space. Accordingly, all real analytic properties can be exploited in the complex field. The ability to exploit these important characteristics motivated the development herein of a simpler framework that can provide better recognition results. The effectiveness of this framework will be clearly elucidated in later sections in this paper.
△ Less
Submitted 7 December, 2016;
originally announced December 2016.
-
Intrinsic Ferromagnetism in the Diluted Magnetic Semiconductor Co:TiO$_2$
Authors:
H. Saadaoui,
X. Luo,
Z. Salman,
X. Y. Cui,
N. N. Bao,
P. Bao,
R. K. Zheng,
L. Tseng,
Y. H. Du,
T. Prokscha,
A. Suter,
T. Liu,
Y. R. Wang,
S. Li,
J. Ding,
S. P. Ringer,
E. Morenzoni,
J. B. Yi
Abstract:
Here we present a study of magnetism in \CTO\ anatase films grown by pulsed laser deposition under a variety of oxygen partial pressures and deposition rates. Energy-dispersive spectrometry and transition electron microscopy analyses indicate that a high deposition rate leads to a homogeneous microstructure, while very low rate or postannealing results in cobalt clustering. Depth resolved low-ener…
▽ More
Here we present a study of magnetism in \CTO\ anatase films grown by pulsed laser deposition under a variety of oxygen partial pressures and deposition rates. Energy-dispersive spectrometry and transition electron microscopy analyses indicate that a high deposition rate leads to a homogeneous microstructure, while very low rate or postannealing results in cobalt clustering. Depth resolved low-energy muon spin rotation experiments show that films grown at a low oxygen partial pressure ($\approx 10^{-6}$ torr) with a uniform structure are fully magnetic, indicating intrinsic ferromagnetism. First principles calculations identify the beneficial role of low oxygen partial pressure in the realization of uniform carrier-mediated ferromagnetism. This work demonstrates that Co:TiO$_2$ is an intrinsic diluted magnetic semiconductor.
△ Less
Submitted 7 December, 2016;
originally announced December 2016.
-
An Analysis of Tournament Structure
Authors:
Nhien Pham Hoang Bao,
Hiroyuki Iida
Abstract:
This paper explores a novel way for analyzing the tournament structures to find a best suitable one for the tournament under consideration. It concerns about three aspects such as tournament conducting cost, competitiveness development and ranking precision. It then proposes a new method using progress tree to detect potential throwaway matches. The analysis performed using the proposed method rev…
▽ More
This paper explores a novel way for analyzing the tournament structures to find a best suitable one for the tournament under consideration. It concerns about three aspects such as tournament conducting cost, competitiveness development and ranking precision. It then proposes a new method using progress tree to detect potential throwaway matches. The analysis performed using the proposed method reveals the strengths and weaknesses of tournament structures. As a conclusion, single elimination is best if we want to qualify one winner only, all matches conducted are exciting in term of competitiveness. Double elimination with proper seeding system is a better choice if we want to qualify more winners. A reasonable number of extra matches need to be conducted in exchange of being able to qualify top four winners. Round-robin gives reliable ranking precision for all participants. However, its conduction cost is very high, and it fails to maintain competitiveness development.
△ Less
Submitted 16 November, 2016;
originally announced November 2016.
-
Beta-decay study of $T_z=-2$ proton-rich nucleus $^{20}$Mg
Authors:
L. J. Sun,
X. X. Xu,
C. J. Lin,
J. S. Wang,
D. Q. Fang,
Z. H. Li,
Y. T. Wang,
J. Li,
L. Yang,
N. R. Ma,
K. Wang,
H. L. Zang,
H. W. Wang,
C. Li,
C. Z. Shi,
M. W. Nie,
X. F. Li,
H. Li,
J. B. Ma,
P. Ma,
S. L. Jin,
M. R. Huang,
Z. Bai,
J. G. Wang,
F. Yang
, et al. (10 additional authors not shown)
Abstract:
The $β$ decay of the drip-line nucleus $^{20}$Mg gives important information on resonances in $^{20}$Na, which are relevant for the astrophysical $rp$-process. A detailed $β$ decay spectroscopic study of $^{20}$Mg was performed by a continuous-implantation method. A detection system was specially developed for charged-particle decay studies, giving improved spectroscopic information including the…
▽ More
The $β$ decay of the drip-line nucleus $^{20}$Mg gives important information on resonances in $^{20}$Na, which are relevant for the astrophysical $rp$-process. A detailed $β$ decay spectroscopic study of $^{20}$Mg was performed by a continuous-implantation method. A detection system was specially developed for charged-particle decay studies, giving improved spectroscopic information including the half-life of $^{20}$Mg, the excitation energies, the branching ratios, and the log $ft$ values for the states in $^{20}$Na populated in the $β$ decay of $^{20}$Mg. A new proton branch was observed and the corresponding excited state in $^{20}$Na was proposed. The large isospin asymmetry for the mirror decays of $^{20}$Mg and $^{20}$O was reproduced, as well. However, no conclusive conclusion can be draw about the astrophysically interesting 2645~keV resonance in $^{20}$Na due to the limited statistics.
△ Less
Submitted 10 October, 2016;
originally announced October 2016.
-
Modeling and Predicting Popularity Dynamics of Microblogs using Self-Excited Hawkes Processes
Authors:
Peng Bao,
Hua-Wei Shen,
Xiaolong Jin,
Xue-Qi Cheng
Abstract:
The ability to model and predict the popularity dynamics of individual user generated items on online media has important implications in a wide range of areas. In this paper, we propose a probabilistic model using a Self-Excited Hawkes Process(SEHP) to characterize the process through which individual microblogs gain their popularity. This model explicitly captures the triggering effect of each f…
▽ More
The ability to model and predict the popularity dynamics of individual user generated items on online media has important implications in a wide range of areas. In this paper, we propose a probabilistic model using a Self-Excited Hawkes Process(SEHP) to characterize the process through which individual microblogs gain their popularity. This model explicitly captures the triggering effect of each forwarding, distinguishing itself from the reinforced Poisson process based model where all previous forwardings are simply aggregated as a single triggering effect. We validate the proposed model by applying it on Sina Weibo, the most popular microblogging network in China. Experimental results demonstrate that the SEHP model consistently outperforms the model based on reinforced Poisson process.
△ Less
Submitted 9 March, 2015;
originally announced March 2015.
-
Prediction of "Forwarding Whom" Behavior in Information Diffusion
Authors:
Peng Bao,
Hua-Wei Shen,
Xue-Qi Cheng
Abstract:
Follow-ship network among users underlies the diffusion dynamics of messages on online social networks. Generally, the structure of underlying social network determines the visibility of messages and the diffusion process. In this paper, we study forwarding behavior of individuals, taking Sina Weibo as an example. We investigate multiple exposures in information diffusion and the "forwarding whom"…
▽ More
Follow-ship network among users underlies the diffusion dynamics of messages on online social networks. Generally, the structure of underlying social network determines the visibility of messages and the diffusion process. In this paper, we study forwarding behavior of individuals, taking Sina Weibo as an example. We investigate multiple exposures in information diffusion and the "forwarding whom" problem associated with multiple exposures. Finally, we model and predict the "forwarding whom" behavior of individuals, combining structural, temporal, historical, and content features. Experimental results demonstrate that our method achieves a high accuracy 91.3%.
△ Less
Submitted 7 November, 2016; v1 submitted 27 October, 2014;
originally announced October 2014.
-
On the roles of graphene oxide doping for enhanced supercurrent in MgB2 based superconductors
Authors:
W. K. Yeoh,
X. Y. Cui,
B. Gault,
K. S. B. De Silva,
X. Xu,
H. W. Liu,
H. W. Yen,
D. Wong,
P. Bao,
D. J. Larson,
I. Martin,
W. X. Li,
R. K. Zheng,
X. L. Wang,
S. X. Dou,
S. P. Ringer
Abstract:
Due to their graphene-like properties after oxygen reduction, incorporation of graphene oxide (GO) sheets into correlated-electron materials offers a new pathway for tailoring their properties. Fabricating GO nanocomposites with polycrystalline MgB2 superconductors leads to an order of magnitude enhancement of the supercurrent at 5 K/8 T and 20 K/4 T. Herein, we introduce a novel experimental appr…
▽ More
Due to their graphene-like properties after oxygen reduction, incorporation of graphene oxide (GO) sheets into correlated-electron materials offers a new pathway for tailoring their properties. Fabricating GO nanocomposites with polycrystalline MgB2 superconductors leads to an order of magnitude enhancement of the supercurrent at 5 K/8 T and 20 K/4 T. Herein, we introduce a novel experimental approach to overcome the formidable challenge of performing quantitative microscopy and microanalysis of such composites, so as to unveil how GO doping influences the structure and hence the material properties. Atom probe microscopy and electron microscopy were used to directly image the GO within the MgB2, and we combined these data with computational simulations to derive the property-enhancing mechanisms. Our results reveal synergetic effects of GO, namely, via localized atomic (carbon and oxygen) doping as well as texturing of the crystals, which provide both inter and intra granular flux pinning. This study opens up new insights into how low-dimensional nanostructures can be integrated into composites to modify the overall properties, using a methodology amenable to a wide range of applications.
△ Less
Submitted 19 May, 2014;
originally announced May 2014.
-
Development of large-area quadrant silicon detector for charged particles
Authors:
Pengfei Bao,
Chengjian Lin,
Feng Yang,
Zhaoqiao Guo,
Tianshu Guo,
Lei Yang,
Lijie Sun,
Huiming Jia,
Xinxing Xu,
Nanru Ma,
Huanqiao Zhang,
Zuhua Liu
Abstract:
The quadrant silicon detector, a kind of passivated implanted planar silicon detector with quadrant structure on the junction side, gained its wide application in charged particle detection. In this paper, the manufacturing procedure, performance test and results of the quadrant silicon detector developed recently at the China Institute of Atomic Energy are presented. The detector is about 300…
▽ More
The quadrant silicon detector, a kind of passivated implanted planar silicon detector with quadrant structure on the junction side, gained its wide application in charged particle detection. In this paper, the manufacturing procedure, performance test and results of the quadrant silicon detector developed recently at the China Institute of Atomic Energy are presented. The detector is about 300 $μ$m thick with a 48$\times$48 mm$^{2}$ active area. The leakage current under full depletion bias voltage of -16 V is about 2.5 nA, and the raising time is better than 160 ns. The energy resolution for 5.157 MeV $α$-particle is around the level of $1\%$. Charge sharing effects between the neighboring quads, leading to complicated correlations between two quads, were observed when $α$ particles illuminated on the junction side. It is explained as a result of distortion of electric field of inter-quad region. Such events is only about $0.6\%$ of all events and can be neglected in an actual application.
△ Less
Submitted 27 January, 2014;
originally announced January 2014.
-
Cumulative Effect in Information Diffusion: A Comprehensive Empirical Study on Microblogging Network
Authors:
Peng Bao,
Hua-Wei Shen,
Wei Chen,
Xue-Qi Cheng
Abstract:
Cumulative effect in social contagions underlies many studies on the spread of innovation, behaviors, and influence. However, few large-scale empirical studies are conducted to validate the existence of cumulative effect in the information diffusion on social networks. In this paper, using the population-scale dataset from the largest Chinese microblogging website, we conduct a comprehensive study…
▽ More
Cumulative effect in social contagions underlies many studies on the spread of innovation, behaviors, and influence. However, few large-scale empirical studies are conducted to validate the existence of cumulative effect in the information diffusion on social networks. In this paper, using the population-scale dataset from the largest Chinese microblogging website, we conduct a comprehensive study on the cumulative effect in information diffusion. We base our study on the diffusion network of each message, where nodes are the involved users and links are the following relationships among them. We find that multiple exposures to the same message indeed increase the possibility of forwarding it. However, additional exposures cannot further improve the chance of forwarding when the number of exposures crosses its peak at two. This finding questions the cumulative effect hypothesis in information diffusion. Furthermore, to clarify the forwarding preference among users, we investigate both the structural motif of the diffusion network and the temporal pattern of information diffusion process among users. The patterns provide vital insight for understanding the variation of message popularity and explain the characteristics of diffusion networks.
△ Less
Submitted 2 June, 2013;
originally announced June 2013.
-
Popularity Prediction in Microblogging Network: A Case Study on Sina Weibo
Authors:
Peng Bao,
Hua-Wei Shen,
Junming Huang,
Xueqi Cheng
Abstract:
Predicting the popularity of content is important for both the host and users of social media sites. The challenge of this problem comes from the inequality of the popularity of con- tent. Existing methods for popularity prediction are mainly based on the quality of content, the interface of social media site to highlight contents, and the collective behavior of user- s. However, little attention…
▽ More
Predicting the popularity of content is important for both the host and users of social media sites. The challenge of this problem comes from the inequality of the popularity of con- tent. Existing methods for popularity prediction are mainly based on the quality of content, the interface of social media site to highlight contents, and the collective behavior of user- s. However, little attention is paid to the structural charac- teristics of the networks spanned by early adopters, i.e., the users who view or forward the content in the early stage of content dissemination. In this paper, taking the Sina Weibo as a case, we empirically study whether structural character- istics can provide clues for the popularity of short messages. We find that the popularity of content is well reflected by the structural diversity of the early adopters. Experimental results demonstrate that the prediction accuracy is signif- icantly improved by incorporating the factor of structural diversity into existing methods.
△ Less
Submitted 15 April, 2013;
originally announced April 2013.
-
Co-development of significant elastic and reversible plastic deformation in nanowires
Authors:
Peite Bao,
Yanbo Wang,
Xiangyuan Cui,
Qiang Gao,
Hongwei Liu,
Wai Kong Yeoh,
Hung-Wei Yen,
Xiaozhou Liao,
Sichao Du,
H. Hoe Tan,
Chennupati Jagadish,
Jin Zou,
Simon P. Ringer,
Rongkun Zheng
Abstract:
When a material is subjected to an applied stress, the material will experience recoverable elastic deformation followed by permanent plastic deformation at the point when the applied stress exceeds the yield stress of the material. Microscopically, the onset of the plasticity usually indicates the activation of dislocation motion, which is considered to be the primary mechanism of plastic deforma…
▽ More
When a material is subjected to an applied stress, the material will experience recoverable elastic deformation followed by permanent plastic deformation at the point when the applied stress exceeds the yield stress of the material. Microscopically, the onset of the plasticity usually indicates the activation of dislocation motion, which is considered to be the primary mechanism of plastic deformation. Once plastic deformation is initiated, further elastic deformation is negligible owing to the limited increase in the flow stress caused by work hardening. Here we present experimental evidence and quantitative analysis of simultaneous development of significant elastic deformation and dislocation-based plastic deformation in single crystal GaAs nanowires (NWs) under bending deformation up to a total strain of ~ 6%. The observation is in sharp contrast to the previous notions regarding the deformation modes. Most of the plastic deformation recovers spontaneously when the external stress is released, and therefore resembles an elastic deformation process.
△ Less
Submitted 12 March, 2013;
originally announced March 2013.
-
Intermediate Resolution Near-Infrared Spectroscopy of 36 late-M Dwarfs
Authors:
R. Deshpande,
E. L. Martín,
M. M. Montgomery,
M. R. Zapatero Osorio,
F. Rodler,
C. del Burgo,
N. Phan Bao,
Y. Lyubchik,
R. Tata,
H. Bouy,
Y. Pavlenko
Abstract:
We present observations of 36 late-M dwarfs obtained with the KeckII/NIRSPEC in the J-band at a resolution of \sim20,000. We have measured projected rotational velocities, absolute radial velocities, and pseudo-equivalent widths of atomic lines. 12 of our targets did not have previous measurements in the literature.
For the other 24 targets, we confirm previously reported measurements. We find t…
▽ More
We present observations of 36 late-M dwarfs obtained with the KeckII/NIRSPEC in the J-band at a resolution of \sim20,000. We have measured projected rotational velocities, absolute radial velocities, and pseudo-equivalent widths of atomic lines. 12 of our targets did not have previous measurements in the literature.
For the other 24 targets, we confirm previously reported measurements. We find that 13 stars from our sample have vsini below our measurement threshold (12 km/s) whereas four of our targets are fast rotators (vsini > 30 km/s). As fast rotation causes spectral features to be washed out, stars with low projected rotational velocities are sought for radial velocity surveys.
At our intermediate spectral resolution we have confirmed the identification of neutral atomic lines reported in Mclean et al. 2007. We also calculated pseudo-equivalent widths (p-EW) of 12 atomic lines. Our results confirm that the p-EW of K I lines are strongly dependent on spectral types. We observe that the p-EW of Fe I and Mn I lines remain fairly constant with later spectral type. We suggest that those lines are particularly suitable for deriving metallicities for late-M dwarfs.
△ Less
Submitted 11 July, 2012;
originally announced July 2012.