-
Integer Binary-Range Alignment Neuron for Spiking Neural Networks
Authors:
Binghao Ye,
Wenjuan Li,
Dong Wang,
Man Yao,
Bing Li,
Weiming Hu,
Dong Liang,
Kun Shang
Abstract:
Spiking Neural Networks (SNNs) are noted for their brain-like computation and energy efficiency, but their performance lags behind Artificial Neural Networks (ANNs) in tasks like image classification and object detection due to the limited representational capacity. To address this, we propose a novel spiking neuron, Integer Binary-Range Alignment Leaky Integrate-and-Fire to exponentially expand t…
▽ More
Spiking Neural Networks (SNNs) are noted for their brain-like computation and energy efficiency, but their performance lags behind Artificial Neural Networks (ANNs) in tasks like image classification and object detection due to the limited representational capacity. To address this, we propose a novel spiking neuron, Integer Binary-Range Alignment Leaky Integrate-and-Fire to exponentially expand the information expression capacity of spiking neurons with only a slight energy increase. This is achieved through Integer Binary Leaky Integrate-and-Fire and range alignment strategy. The Integer Binary Leaky Integrate-and-Fire allows integer value activation during training and maintains spike-driven dynamics with binary conversion expands virtual timesteps during inference. The range alignment strategy is designed to solve the spike activation limitation problem where neurons fail to activate high integer values. Experiments show our method outperforms previous SNNs, achieving 74.19% accuracy on ImageNet and 66.2% mAP@50 and 49.1% mAP@50:95 on COCO, surpassing previous bests with the same architecture by +3.45% and +1.6% and +1.8%, respectively. Notably, our SNNs match or exceed ANNs' performance with the same architecture, and the energy efficiency is improved by 6.3${\times}$.
△ Less
Submitted 5 June, 2025;
originally announced June 2025.
-
Collaboration among Multiple Large Language Models for Medical Question Answering
Authors:
Kexin Shang,
Chia-Hsuan Chang,
Christopher C. Yang
Abstract:
Empowered by vast internal knowledge reservoir, the new generation of large language models (LLMs) demonstrate untapped potential to tackle medical tasks. However, there is insufficient effort made towards summoning up a synergic effect from multiple LLMs' expertise and background. In this study, we propose a multi-LLM collaboration framework tailored on a medical multiple-choice questions dataset…
▽ More
Empowered by vast internal knowledge reservoir, the new generation of large language models (LLMs) demonstrate untapped potential to tackle medical tasks. However, there is insufficient effort made towards summoning up a synergic effect from multiple LLMs' expertise and background. In this study, we propose a multi-LLM collaboration framework tailored on a medical multiple-choice questions dataset. Through post-hoc analysis on 3 pre-trained LLM participants, our framework is proved to boost all LLMs reasoning ability as well as alleviate their divergence among questions. We also measure an LLM's confidence when it confronts with adversary opinions from other LLMs and observe a concurrence between LLM's confidence and prediction accuracy.
△ Less
Submitted 22 May, 2025;
originally announced May 2025.
-
Dynamic Object Geographic Coordinate Recognition: An Attitude-Free and Reference-Free Framework via Intrinsic Linear Algebraic Structures
Authors:
Junfan Yi,
Ke-ke Shang,
Michael Small
Abstract:
The Earth, a temporal complex system, is witnessing a shift in research on its coordinate system, moving away from conventional static positioning toward embracing dynamic modeling. Early positioning concentrates on static natural geographic features, with the emergence of geographic information systems introducing a growing demand for spatial data, the focus turns to capturing dynamic objects. Ho…
▽ More
The Earth, a temporal complex system, is witnessing a shift in research on its coordinate system, moving away from conventional static positioning toward embracing dynamic modeling. Early positioning concentrates on static natural geographic features, with the emergence of geographic information systems introducing a growing demand for spatial data, the focus turns to capturing dynamic objects. However, previous methods typically rely on expensive devices or external calibration objects for attitude measurement. We propose an applied mathematical model that utilizes time series, the nature of dynamic object, to determine relative attitudes without absolute attitude measurements, then employs SVD-based methods for 3D coordinate recognition. The model is validated with negligible error in a numerical simulation, which is inherent in computer numerical approximations. What in follows, to assess our model in the engineering scenario, we propose a framework featuring the integration of applied mathematics with AI, utilizing only three cameras to capture an UAV. We enhance the YOLOv8 model by leveraging time series for the accurate 2D coordinate acquisitions, which is then used as input for 2D-to-3D conversion via our mathematics model. As a result, the framework demonstrates high precision, as evidenced by low error metrics including root mean square error, mean absolute error, maximum error, and a strong R-squared value. It is important to note that the mathematical method itself is inherently error-free; any observed inaccuracies are due solely to external hardware or the AI-based 2D coordinate acquisition process, which represents an improved version of the current state-of-the-art. Our framework enriches geodetic theory by providing a streamlined model for the 3D positioning of non-cooperative targets, minimizing input attitude parameters, leveraging applied mathematics and AI.
△ Less
Submitted 12 May, 2025;
originally announced May 2025.
-
Triadic Closure-Heterogeneity-Harmony GCN for Link Prediction
Authors:
Ke-ke Shang,
Junfan Yi,
Michael Small,
Yijie Zhou
Abstract:
Link prediction aims to estimate the likelihood of connections between pairs of nodes in complex networks, which is beneficial to many applications from friend recommendation to metabolic network reconstruction. Traditional heuristic-based methodologies in the field of complex networks typically depend on predefined assumptions about node connectivity, limiting their generalizability across divers…
▽ More
Link prediction aims to estimate the likelihood of connections between pairs of nodes in complex networks, which is beneficial to many applications from friend recommendation to metabolic network reconstruction. Traditional heuristic-based methodologies in the field of complex networks typically depend on predefined assumptions about node connectivity, limiting their generalizability across diverse networks. While recent graph neural network (GNN) approaches capture global structural features effectively, they often neglect node attributes and intrinsic structural relationships between node pairs. To address this, we propose TriHetGCN, an extension of traditional Graph Convolutional Networks (GCNs) that incorporates explicit topological indicators -- triadic closure and degree heterogeneity. TriHetGCN consists of three modules: topology feature construction, graph structural representation, and connection probability prediction. The topology feature module constructs node features using shortest path distances to anchor nodes, enhancing global structure perception. The graph structural module integrates topological indicators into the GCN framework to model triadic closure and heterogeneity. The connection probability module uses deep learning to predict links. Evaluated on nine real-world datasets, from traditional networks without node attributes to large-scale networks with rich features, TriHetGCN achieves state-of-the-art performance, outperforming mainstream methods. This highlights its strong generalization across diverse network types, offering a promising framework that bridges statistical physics and graph deep learning.
△ Less
Submitted 29 April, 2025;
originally announced April 2025.
-
Sensitivity of the CUPID experiment to $0νββ$ decay of $^{100}$Mo
Authors:
K. Alfonso,
A. Armatol,
C. Augier,
F. T. Avignone III,
O. Azzolini,
A. S. Barabash,
G. Bari,
A. Barresi,
D. Baudin,
F. Bellini,
G. Benato,
L. Benussi,
V. Berest,
M. Beretta,
L. Bergé,
M. Bettelli,
M. Biassoni,
J. Billard,
F. Boffelli,
V. Boldrini,
E. D. Brandani,
C. Brofferio,
C. Bucci,
M. Buchynska,
J. Camilleri
, et al. (167 additional authors not shown)
Abstract:
CUPID is a next-generation bolometric experiment to search for neutrinoless double-beta decay ($0νββ$) of $^{100}$Mo using Li$_2$MoO$_4$ scintillating crystals. It will operate 1596 crystals at $\sim$10 mK in the CUORE cryostat at the Laboratori Nazionali del Gran Sasso in Italy. Each crystal will be facing two Ge-based bolometric light detectors for $α$ rejection. We compute the discovery and the…
▽ More
CUPID is a next-generation bolometric experiment to search for neutrinoless double-beta decay ($0νββ$) of $^{100}$Mo using Li$_2$MoO$_4$ scintillating crystals. It will operate 1596 crystals at $\sim$10 mK in the CUORE cryostat at the Laboratori Nazionali del Gran Sasso in Italy. Each crystal will be facing two Ge-based bolometric light detectors for $α$ rejection. We compute the discovery and the exclusion sensitivity of CUPID to $0νββ$ in a Frequentist and a Bayesian framework. This computation is done numerically based on pseudo-experiments. For the CUPID baseline scenario, with a background and an energy resolution of $1.0 \times 10^{-4}$ counts/keV/kg/yr and 5 keV FWHM at the Q-value, respectively, this results in a Bayesian exclusion sensitivity (90% c.i.) of $\hat{T}_{1/2} > 1.6^{+0.6}_{-0.5} \times 10^{27} \ \mathrm{yr}$, corresponding to the effective Majorana neutrino mass of $\hat{m}_{ββ} < \ 9.6$ -- $16.3 \ \mathrm{meV}$. The Frequentist discovery sensitivity (3$σ$) is $\hat{T}_{1/2}= 1.0 \times 10^{27} \ \mathrm{yr}$, corresponding to $\hat{m}_{ββ}= \ 12.2$ -- $20.6 \ \mathrm{meV}$.
△ Less
Submitted 19 April, 2025;
originally announced April 2025.
-
Machine Learning Informed by Micro and Mesoscopic Statistical Physics Methods for Community Detection
Authors:
Yijun Ran,
Junfan Yi,
Wei Si,
Michael Small,
Ke-ke Shang
Abstract:
Community detection plays a crucial role in understanding the structural organization of complex networks. Previous methods, particularly those from statistical physics, primarily focus on the analysis of mesoscopic network structures and often struggle to integrate fine-grained node similarities. To address this limitation, we propose a low-complexity framework that integrates machine learning to…
▽ More
Community detection plays a crucial role in understanding the structural organization of complex networks. Previous methods, particularly those from statistical physics, primarily focus on the analysis of mesoscopic network structures and often struggle to integrate fine-grained node similarities. To address this limitation, we propose a low-complexity framework that integrates machine learning to embed micro-level node-pair similarities into mesoscopic community structures. By leveraging ensemble learning models, our approach enhances both structural coherence and detection accuracy. Experimental evaluations on artificial and real-world networks demonstrate that our framework consistently outperforms conventional methods, achieving higher modularity and improved accuracy in NMI and ARI. Notably, when ground-truth labels are available, our approach yields the most accurate detection results, effectively recovering real-world community structures while minimizing misclassifications. To further explain our framework's performance, we analyze the correlation between node-pair similarity and evaluation metrics. The results reveal a strong and statistically significant correlation, underscoring the critical role of node-pair similarity in enhancing detection accuracy. Overall, our findings highlight the synergy between machine learning and statistical physics, demonstrating how machine learning techniques can enhance network analysis and uncover complex structural patterns.
△ Less
Submitted 18 April, 2025;
originally announced April 2025.
-
Innovating Bolometers' Mounting: A Gravity-Based Approach
Authors:
The CUPID Collaboration,
K. Alfonso,
A. Armatol,
C. Augier,
F. T. Avignone III,
O. Azzolini,
A. S. Barabash,
G. Bari,
A. Barresi,
D. Baudin,
F. Bellini,
G. Benato,
L. Benussi,
V. Berest,
M. Beretta,
M. Bettelli,
M. Biassoni,
J. Billard,
F. Boffelli,
V. Boldrini,
E. D. Brandani,
C. Brofferio,
C. Bucci,
M. Buchynska,
J. Camilleri
, et al. (168 additional authors not shown)
Abstract:
Cryogenic calorimeters, also known as bolometers, are among the leading technologies for searching for rare events. The CUPID experiment is exploiting this technology to deploy a tonne-scale detector to search for neutrinoless double-beta decay of $^{100}$Mo. The CUPID collaboration proposed an innovative approach to assembling bolometers in a stacked configuration, held in position solely by grav…
▽ More
Cryogenic calorimeters, also known as bolometers, are among the leading technologies for searching for rare events. The CUPID experiment is exploiting this technology to deploy a tonne-scale detector to search for neutrinoless double-beta decay of $^{100}$Mo. The CUPID collaboration proposed an innovative approach to assembling bolometers in a stacked configuration, held in position solely by gravity. This gravity-based assembly method is unprecedented in the field of bolometers and offers several advantages, including relaxed mechanical tolerances and simplified construction. To assess and optimize its performance, we constructed a medium-scale prototype hosting 28 Li$_2$MoO$_4$ crystals and 30 Ge light detectors, both operated as cryogenic calorimeters at the Laboratori Nazionali del Gran Sasso (Italy). Despite an unexpected excess of noise in the light detectors, the results of this test proved (i) a thermal stability better than $\pm$0.5 mK at 10 mK, (ii) a good energy resolution of Li$_2$MoO$_4$ bolometers, (6.6 $\pm$ 2.2) keV FWHM at 2615 keV, and (iii) a Li$_2$MoO$_4$ light yield measured by the closest light detector of 0.36 keV/MeV, sufficient to guarantee the particle identification requested by CUPID.
△ Less
Submitted 6 March, 2025;
originally announced March 2025.
-
CUPID, the CUORE Upgrade with Particle Identification
Authors:
The CUPID Collaboration,
K. Alfonso,
A. Armatol,
C. Augier,
F. T. Avignone III,
O. Azzolini,
A. S. Barabash,
G. Bari,
A. Barresi,
D. Baudin,
F. Bellini,
G. Benato,
L. Benussi,
V. Berest,
M. Beretta,
M. Bettelli,
M. Biassoni,
J. Billard,
F. Boffelli,
V. Boldrini,
E. D. Brandani,
C. Brofferio,
C. Bucci,
M. Buchynska,
J. Camilleri
, et al. (166 additional authors not shown)
Abstract:
CUPID, the CUORE Upgrade with Particle Identification, is a next-generation experiment to search for neutrinoless double beta decay ($0νββ$) and other rare events using enriched Li$_2$$^{100}$MoO$_4$ scintillating bolometers. It will be hosted by the CUORE cryostat located at the Laboratori Nazionali del Gran Sasso in Italy. The main physics goal of CUPID is to search for $0νββ$\ of $^{100}$Mo wit…
▽ More
CUPID, the CUORE Upgrade with Particle Identification, is a next-generation experiment to search for neutrinoless double beta decay ($0νββ$) and other rare events using enriched Li$_2$$^{100}$MoO$_4$ scintillating bolometers. It will be hosted by the CUORE cryostat located at the Laboratori Nazionali del Gran Sasso in Italy. The main physics goal of CUPID is to search for $0νββ$\ of $^{100}$Mo with a discovery sensitivity covering the full neutrino mass regime in the inverted ordering scenario, as well as the portion of the normal ordering regime with lightest neutrino mass larger than 10 meV. With a conservative background index of 10$^{-4}$ cnts/(keV$\cdot$kg$\cdot$yr), 240 kg isotope mass, 5 keV FWHM energy resolution and 10 live-years of data taking, CUPID will have a 90\% C.L. half-life exclusion sensitivity of 1.8 $\cdot$ 10$^{27}$ yr, corresponding to an effective Majorana neutrino mass ($m_{ββ}$) sensitivity of 9--15 meV, and a $3σ$ discovery sensitivity of 1 $\cdot$ 10$^{27}$ yr, corresponding to an $m_{ββ}$ range of 12--21 meV.
△ Less
Submitted 1 March, 2025;
originally announced March 2025.
-
Interactive Gadolinium-Free MRI Synthesis: A Transformer with Localization Prompt Learning
Authors:
Linhao Li,
Changhui Su,
Yu Guo,
Huimao Zhang,
Dong Liang,
Kun Shang
Abstract:
Contrast-enhanced magnetic resonance imaging (CE-MRI) is crucial for tumor detection and diagnosis, but the use of gadolinium-based contrast agents (GBCAs) in clinical settings raises safety concerns due to potential health risks. To circumvent these issues while preserving diagnostic accuracy, we propose a novel Transformer with Localization Prompts (TLP) framework for synthesizing CE-MRI from no…
▽ More
Contrast-enhanced magnetic resonance imaging (CE-MRI) is crucial for tumor detection and diagnosis, but the use of gadolinium-based contrast agents (GBCAs) in clinical settings raises safety concerns due to potential health risks. To circumvent these issues while preserving diagnostic accuracy, we propose a novel Transformer with Localization Prompts (TLP) framework for synthesizing CE-MRI from non-contrast MR images. Our architecture introduces three key innovations: a hierarchical backbone that uses efficient Transformer to process multi-scale features; a multi-stage fusion system consisting of Local and Global Fusion modules that hierarchically integrate complementary information via spatial attention operations and cross-attention mechanisms, respectively; and a Fuzzy Prompt Generation (FPG) module that enhances the TLP model's generalization by emulating radiologists' manual annotation through stochastic feature perturbation. The framework uniquely enables interactive clinical integration by allowing radiologists to input diagnostic prompts during inference, synergizing artificial intelligence with medical expertise. This research establishes a new paradigm for contrast-free MRI synthesis while addressing critical clinical needs for safer diagnostic procedures. Codes are available at https://github.com/ChanghuiSu/TLP.
△ Less
Submitted 3 March, 2025;
originally announced March 2025.
-
Towards energy-insensitive and robust neutron/gamma classification: A learning-based frequency-domain parametric approach
Authors:
Pengcheng Ai,
Hongtao Qin,
Xiangming Sun,
Kaiwen Shang
Abstract:
Neutron/gamma discrimination has been intensively researched in recent years, due to its unique scientific value and widespread applications. With the advancement of detection materials and algorithms, nowadays we can achieve fairly good discrimination. However, further improvements rely on better utilization of detector raw signals, especially energy-independent pulse characteristics. We begin by…
▽ More
Neutron/gamma discrimination has been intensively researched in recent years, due to its unique scientific value and widespread applications. With the advancement of detection materials and algorithms, nowadays we can achieve fairly good discrimination. However, further improvements rely on better utilization of detector raw signals, especially energy-independent pulse characteristics. We begin by discussing why figure-of-merit (FoM) is not a comprehensive criterion for high-precision neutron/gamma discriminators, and proposing a new evaluation method based on adversarial sampling. Inspired by frequency-domain analysis in existing literature, parametric linear/nonlinear models with minimum complexity are created, upon the discrete spectrum, with tunable parameters just as neural networks. We train the models on an open-source neutron/gamma dataset (CLYC crystals with silicon photomultipliers) preprocessed by charge normalization to discover and exploit energy-independent features. The performance is evaluated on different sampling rates and noise levels, in comparison with the frequency classification index and conventional methods. The frequency-domain parametric models show higher accuracy and better adaptability to variations of data integrity than other discriminators. The proposed method is also promising for online inference on economical hardware and portable devices.
△ Less
Submitted 26 May, 2025; v1 submitted 11 February, 2025;
originally announced February 2025.
-
InfoBFR: Real-World Blind Face Restoration via Information Bottleneck
Authors:
Nan Gao,
Jia Li,
Huaibo Huang,
Ke Shang,
Ran He
Abstract:
Blind face restoration (BFR) is a highly challenging problem due to the uncertainty of data degradation patterns. Current BFR methods have realized certain restored productions but with inherent neural degradations that limit real-world generalization in complicated scenarios. In this paper, we propose a plug-and-play framework InfoBFR to tackle neural degradations, e.g., prior bias, topological d…
▽ More
Blind face restoration (BFR) is a highly challenging problem due to the uncertainty of data degradation patterns. Current BFR methods have realized certain restored productions but with inherent neural degradations that limit real-world generalization in complicated scenarios. In this paper, we propose a plug-and-play framework InfoBFR to tackle neural degradations, e.g., prior bias, topological distortion, textural distortion, and artifact residues, which achieves high-generalization face restoration in diverse wild and heterogeneous scenes. Specifically, based on the results from pre-trained BFR models, InfoBFR considers information compression using manifold information bottleneck (MIB) and information compensation with efficient diffusion LoRA to conduct information optimization. InfoBFR effectively synthesizes high-fidelity faces without attribute and identity distortions. Comprehensive experimental results demonstrate the superiority of InfoBFR over state-of-the-art GAN-based and diffusion-based BFR methods, with around 70ms consumption, 16M trainable parameters, and nearly 85% BFR-boosting. It is promising that InfoBFR will be the first plug-and-play restorer universally employed by diverse BFR models to conquer neural degradations.
△ Less
Submitted 26 January, 2025;
originally announced January 2025.
-
CCxTrust: Confidential Computing Platform Based on TEE and TPM Collaborative Trust
Authors:
Ketong Shang,
Jiangnan Lin,
Yu Qin,
Muyan Shen,
Hongzhan Ma,
Wei Feng,
Dengguo Feng
Abstract:
Confidential Computing has emerged to address data security challenges in cloud-centric deployments by protecting data in use through hardware-level isolation. However, reliance on a single hardware root of trust (RoT) limits user confidence in cloud platforms, especially for high-performance AI services, where end-to-end protection of sensitive models and data is critical. Furthermore, the lack o…
▽ More
Confidential Computing has emerged to address data security challenges in cloud-centric deployments by protecting data in use through hardware-level isolation. However, reliance on a single hardware root of trust (RoT) limits user confidence in cloud platforms, especially for high-performance AI services, where end-to-end protection of sensitive models and data is critical. Furthermore, the lack of interoperability and a unified trust model in multi-cloud environments prevents the establishment of a cross-platform, cross-cloud chain of trust, creating a significant trust gap for users with high privacy requirements. To address the challenges mentioned above, this paper proposes CCxTrust (Confidential Computing with Trust), a confidential computing platform leveraging collaborative roots of trust from TEE and TPM. CCxTrust combines the black-box RoT embedded in the CPU-TEE with the flexible white-box RoT of TPM to establish a collaborative trust framework. The platform implements independent Roots of Trust for Measurement (RTM) for TEE and TPM, and a collaborative Root of Trust for Report (RTR) for composite attestation. The Root of Trust for Storage (RTS) is solely supported by TPM. We also present the design and implementation of a confidential TPM supporting multiple modes for secure use within confidential virtual machines. Additionally, we propose a composite attestation protocol integrating TEE and TPM to enhance security and attestation efficiency, which is proven secure under the PCL protocol security model. We implemented a prototype of CCxTrust on a confidential computing server with AMD SEV-SNP and TPM chips, requiring minimal modifications to the TPM and guest Linux kernel. The composite attestation efficiency improved by 24% without significant overhead, while Confidential TPM performance showed a 16.47% reduction compared to standard TPM.
△ Less
Submitted 11 December, 2024; v1 submitted 4 December, 2024;
originally announced December 2024.
-
Learning Pareto Set for Multi-Objective Continuous Robot Control
Authors:
Tianye Shu,
Ke Shang,
Cheng Gong,
Yang Nan,
Hisao Ishibuchi
Abstract:
For a control problem with multiple conflicting objectives, there exists a set of Pareto-optimal policies called the Pareto set instead of a single optimal policy. When a multi-objective control problem is continuous and complex, traditional multi-objective reinforcement learning (MORL) algorithms search for many Pareto-optimal deep policies to approximate the Pareto set, which is quite resource-c…
▽ More
For a control problem with multiple conflicting objectives, there exists a set of Pareto-optimal policies called the Pareto set instead of a single optimal policy. When a multi-objective control problem is continuous and complex, traditional multi-objective reinforcement learning (MORL) algorithms search for many Pareto-optimal deep policies to approximate the Pareto set, which is quite resource-consuming. In this paper, we propose a simple and resource-efficient MORL algorithm that learns a continuous representation of the Pareto set in a high-dimensional policy parameter space using a single hypernet. The learned hypernet can directly generate various well-trained policy networks for different user preferences. We compare our method with two state-of-the-art MORL algorithms on seven multi-objective continuous robot control problems. Experimental results show that our method achieves the best overall performance with the least training parameters. An interesting observation is that the Pareto set is well approximated by a curved line or surface in a high-dimensional parameter space. This observation will provide insight for researchers to design new MORL algorithms.
△ Less
Submitted 27 June, 2024;
originally announced June 2024.
-
DiffMAC: Diffusion Manifold Hallucination Correction for High Generalization Blind Face Restoration
Authors:
Nan Gao,
Jia Li,
Huaibo Huang,
Zhi Zeng,
Ke Shang,
Shuwu Zhang,
Ran He
Abstract:
Blind face restoration (BFR) is a highly challenging problem due to the uncertainty of degradation patterns. Current methods have low generalization across photorealistic and heterogeneous domains. In this paper, we propose a Diffusion-Information-Diffusion (DID) framework to tackle diffusion manifold hallucination correction (DiffMAC), which achieves high-generalization face restoration in divers…
▽ More
Blind face restoration (BFR) is a highly challenging problem due to the uncertainty of degradation patterns. Current methods have low generalization across photorealistic and heterogeneous domains. In this paper, we propose a Diffusion-Information-Diffusion (DID) framework to tackle diffusion manifold hallucination correction (DiffMAC), which achieves high-generalization face restoration in diverse degraded scenes and heterogeneous domains. Specifically, the first diffusion stage aligns the restored face with spatial feature embedding of the low-quality face based on AdaIN, which synthesizes degradation-removal results but with uncontrollable artifacts for some hard cases. Based on Stage I, Stage II considers information compression using manifold information bottleneck (MIB) and finetunes the first diffusion model to improve facial fidelity. DiffMAC effectively fights against blind degradation patterns and synthesizes high-quality faces with attribute and identity consistencies. Experimental results demonstrate the superiority of DiffMAC over state-of-the-art methods, with a high degree of generalization in real-world and heterogeneous settings. The source code and models will be public.
△ Less
Submitted 15 March, 2024;
originally announced March 2024.
-
Radiation and Heat Transport in Divergent Shock-Bubble Interactions
Authors:
Kelin Kurzer-Ogul,
Brian M. Haines,
David S. Montgomery,
Silvia Pandolfi,
Joshua P. Sauppe,
Andrew F. T. Leong,
Daniel Hodge,
Pawel M. Kozlowski,
Stefano Marchesini,
Eric Cunningham,
Eric Galtier,
Dimitri Khaghani,
Hae Ja Lee,
Bob Nagler,
Richard L. Sandberg,
Arianna E. Gleason,
Hussein Aluie,
Jessica K. Shang
Abstract:
Shock-bubble interactions (SBI) are important across a wide range of physical systems. In inertial confinement fusion, interactions between laser-driven shocks and micro-voids in both ablators and foam targets generate instabilities that are a major obstacle in achieving ignition. Experiments imaging the collapse of such voids at high energy densities (HED) are constrained by spatial and temporal…
▽ More
Shock-bubble interactions (SBI) are important across a wide range of physical systems. In inertial confinement fusion, interactions between laser-driven shocks and micro-voids in both ablators and foam targets generate instabilities that are a major obstacle in achieving ignition. Experiments imaging the collapse of such voids at high energy densities (HED) are constrained by spatial and temporal resolution, making simulations a vital tool in understanding these systems. In this study, we benchmark several radiation and thermal transport models in the xRAGE hydrodynamic code against experimental images of a collapsing mesoscale void during the passage of a 300 GPa shock. We also quantitatively examine the role of transport physics in the evolution of the SBI. This allows us to understand the dynamics of the interaction at timescales shorter than experimental imaging framerates. We find that all radiation models examined reproduce empirical shock velocities within experimental error. Radiation transport is found to reduce shock pressures by providing an additional energy pathway in the ablation region, but this effect is small ($\sim$1\% of total shock pressure). Employing a flux-limited Spitzer model for heat conduction, we find that flux limiters between 0.03 and 0.10 produce agreement with experimental velocities, suggesting that the system is well-within the Spitzer regime. Higher heat conduction is found to lower temperatures in the ablated plasma and to prevent secondary shocks at the ablation front, resulting in weaker primary shocks. Finally, we confirm that the SBI-driven instabilities observed in the HED regime are baroclinically driven, as in the low energy case.
△ Less
Submitted 5 March, 2024;
originally announced March 2024.
-
Randomness-Efficient Constructions of Capacity-Achieving List-Decodable Codes
Authors:
Jonathan Mosheiff,
Nicolas Resch,
Kuo Shang,
Chen Yuan
Abstract:
We wish to generate list-decodable codes over small alphabets using as little randomness as possible. Specifically, we hope to generate codes achieving what we term the Elias bound, which means that they are $(ρ,L)$-list-decodable with rate $R \geq 1-h(ρ)-O(1/L)$. A long line of work shows that uniformly random linear codes (RLCs) achieve the Elias bound: hence, we know $O(n^2)$ random bits suffic…
▽ More
We wish to generate list-decodable codes over small alphabets using as little randomness as possible. Specifically, we hope to generate codes achieving what we term the Elias bound, which means that they are $(ρ,L)$-list-decodable with rate $R \geq 1-h(ρ)-O(1/L)$. A long line of work shows that uniformly random linear codes (RLCs) achieve the Elias bound: hence, we know $O(n^2)$ random bits suffice. Prior works demonstrate that just $O(Ln)$ random bits suffice, via puncturing of low-bias codes. These recent constructions are combinatorial.
We provide two new constructions, which are algebraic. Compared to prior works, our constructions are simpler and more direct. Furthermore, our codes are designed in such a way that their duals are also quite easy to analyze. Our first construction -- which can be seen as a generalization of the Wozencraft ensemble -- achieves the Elias bound and consumes $Ln$ random bits. Additionally, its dual code achieves the GV-bound with high probability, and both the primal and dual admit quasilinear-time encoding algorithms. The second construction consumes $2nL$ random bits and yields a code where both it and its dual achieve the Elias bound. As we discuss, properties of a dual code are often crucial for applications in cryptography.
In all of the above cases -- including the prior works achieving randomness complexity $O(Ln)$ -- the codes are designed to "approximate" RLCs. Namely, for a given locality parameter $L$ we construct codes achieving the same $L$-local properties as RLCs. This allows one to appeal to known list-decodability results for RLCs and thereby conclude that the code approximating an RLC also achieves the Elias bound. As a final contribution, we indicate that such a proof strategy is inherently unable to generate list-decodable codes of rate $R$ over $\mathbb F_q$ with less than $L(1-R)n\log_2(q)$ bits of randomness.
△ Less
Submitted 15 May, 2024; v1 submitted 18 February, 2024;
originally announced February 2024.
-
Realistic Restorer: artifact-free flow restorer(AF2R) for MRI motion artifact removal
Authors:
Jiandong Su,
Kun Shang,
Dong Liang
Abstract:
Motion artifact is a major challenge in magnetic resonance imaging (MRI) that severely degrades image quality, reduces examination efficiency, and makes accurate diagnosis difficult. However, previous methods often relied on implicit models for artifact correction, resulting in biases in modeling the artifact formation mechanism and characterizing the relationship between artifact information and…
▽ More
Motion artifact is a major challenge in magnetic resonance imaging (MRI) that severely degrades image quality, reduces examination efficiency, and makes accurate diagnosis difficult. However, previous methods often relied on implicit models for artifact correction, resulting in biases in modeling the artifact formation mechanism and characterizing the relationship between artifact information and anatomical details. These limitations have hindered the ability to obtain high-quality MR images. In this work, we incorporate the artifact generation mechanism to reestablish the relationship between artifacts and anatomical content in the image domain, highlighting the superiority of explicit models over implicit models in medical problems. Based on this, we propose a novel end-to-end image domain model called AF2R, which addresses this problem using conditional normalization flow. Specifically, we first design a feature encoder to extract anatomical features from images with motion artifacts. Then, through a series of reversible transformations using the feature-to-image flow module, we progressively obtain MR images unaffected by motion artifacts. Experimental results on simulated and real datasets demonstrate that our method achieves better performance in both quantitative and qualitative results, preserving better anatomical details.
△ Less
Submitted 19 June, 2023;
originally announced June 2023.
-
RetinexFlow for CT metal artifact reduction
Authors:
Jiandong Su,
Ce Wang,
Yinsheng Li,
Kun Shang,
Dong Liang
Abstract:
Metal artifacts is a major challenge in computed tomography (CT) imaging, significantly degrading image quality and making accurate diagnosis difficult. However, previous methods either require prior knowledge of the location of metal implants, or have modeling deviations with the mechanism of artifact formation, which limits the ability to obtain high-quality CT images. In this work, we formulate…
▽ More
Metal artifacts is a major challenge in computed tomography (CT) imaging, significantly degrading image quality and making accurate diagnosis difficult. However, previous methods either require prior knowledge of the location of metal implants, or have modeling deviations with the mechanism of artifact formation, which limits the ability to obtain high-quality CT images. In this work, we formulate metal artifacts reduction problem as a combination of decomposition and completion tasks. And we propose RetinexFlow, which is a novel end-to-end image domain model based on Retinex theory and conditional normalizing flow, to solve it. Specifically, we first design a feature decomposition encoder for decomposing the metal implant component and inherent component, and extracting the inherent feature. Then, it uses a feature-to-image flow module to complete the metal artifact-free CT image step by step through a series of invertible transformations. These designs are incorporated in our model with a coarse-to-fine strategy, enabling it to achieve superior performance. The experimental results on on simulation and clinical datasets show our method achieves better quantitative and qualitative results, exhibiting better visual performance in artifact removal and image fidelity
△ Less
Submitted 18 June, 2023;
originally announced June 2023.
-
DDMM-Synth: A Denoising Diffusion Model for Cross-modal Medical Image Synthesis with Sparse-view Measurement Embedding
Authors:
Xiaoyue Li,
Kai Shang,
Gaoang Wang,
Mark D. Butala
Abstract:
Reducing the radiation dose in computed tomography (CT) is important to mitigate radiation-induced risks. One option is to employ a well-trained model to compensate for incomplete information and map sparse-view measurements to the CT reconstruction. However, reconstruction from sparsely sampled measurements is insufficient to uniquely characterize an object in CT, and a learned prior model may be…
▽ More
Reducing the radiation dose in computed tomography (CT) is important to mitigate radiation-induced risks. One option is to employ a well-trained model to compensate for incomplete information and map sparse-view measurements to the CT reconstruction. However, reconstruction from sparsely sampled measurements is insufficient to uniquely characterize an object in CT, and a learned prior model may be inadequate for unencountered cases. Medical modal translation from magnetic resonance imaging (MRI) to CT is an alternative but may introduce incorrect information into the synthesized CT images in addition to the fact that there exists no explicit transformation describing their relationship. To address these issues, we propose a novel framework called the denoising diffusion model for medical image synthesis (DDMM-Synth) to close the performance gaps described above. This framework combines an MRI-guided diffusion model with a new CT measurement embedding reverse sampling scheme. Specifically, the null-space content of the one-step denoising result is refined by the MRI-guided data distribution prior, and its range-space component derived from an explicit operator matrix and the sparse-view CT measurements is directly integrated into the inference stage. DDMM-Synth can adjust the projection number of CT a posteriori for a particular clinical application and its modified version can even improve the results significantly for noisy cases. Our results show that DDMM-Synth outperforms other state-of-the-art supervised-learning-based baselines under fair experimental conditions.
△ Less
Submitted 28 March, 2023;
originally announced March 2023.
-
EEG Opto-processor: epileptic seizure detection using diffractive photonic computing units
Authors:
Tao Yan,
Maoqi Zhang,
Sen Wan,
Kaifeng Shang,
Haiou Zhang,
Xun Cao,
Xing Lin,
Qionghai Dai
Abstract:
Electroencephalography (EEG) analysis extracts critical information from brain signals, which has provided fundamental support for various applications, including brain-disease diagnosis and brain-computer interface. However, the real-time processing of large-scale EEG signals at high energy efficiency has placed great challenges for electronic processors on edge computing devices. Here, we propos…
▽ More
Electroencephalography (EEG) analysis extracts critical information from brain signals, which has provided fundamental support for various applications, including brain-disease diagnosis and brain-computer interface. However, the real-time processing of large-scale EEG signals at high energy efficiency has placed great challenges for electronic processors on edge computing devices. Here, we propose the EEG opto-processor based on diffractive photonic computing units (DPUs) to effectively process the extracranial and intracranial EEG signals and perform epileptic seizure detection. The signals of EEG channels within a second-time window are optically encoded as inputs to the constructed diffractive neural networks for classification, which monitors the brain state to determine whether it's the symptom of an epileptic seizure or not. We developed both the free-space and integrated DPUs as edge computing systems and demonstrated their applications for real-time epileptic seizure detection with the benchmark datasets, i.e., the CHB-MIT extracranial EEG dataset and Epilepsy-iEEG-Multicenter intracranial EEG dataset, at high computing performance. Along with the channel selection mechanism, both the numerical evaluations and experimental results validated the sufficient high classification accuracies of the proposed opto-processors for supervising the clinical diagnosis. Our work opens up a new research direction of utilizing photonic computing techniques for processing large-scale EEG signals in promoting its broader applications.
△ Less
Submitted 9 December, 2022;
originally announced January 2023.
-
Active CT Reconstruction with a Learned Sampling Policy
Authors:
Ce Wang,
Kun Shang,
Haimiao Zhang,
Shang Zhao,
Dong Liang,
S. Kevin Zhou
Abstract:
Computed tomography (CT) is a widely-used imaging technology that assists clinical decision-making with high-quality human body representations. To reduce the radiation dose posed by CT, sparse-view and limited-angle CT are developed with preserved image quality. However, these methods are still stuck with a fixed or uniform sampling strategy, which inhibits the possibility of acquiring a better i…
▽ More
Computed tomography (CT) is a widely-used imaging technology that assists clinical decision-making with high-quality human body representations. To reduce the radiation dose posed by CT, sparse-view and limited-angle CT are developed with preserved image quality. However, these methods are still stuck with a fixed or uniform sampling strategy, which inhibits the possibility of acquiring a better image with an even reduced dose. In this paper, we explore this possibility via learning an active sampling policy that optimizes the sampling positions for patient-specific, high-quality reconstruction. To this end, we design an \textit{intelligent agent} for active recommendation of sampling positions based on on-the-fly reconstruction with obtained sinograms in a progressive fashion. With such a design, we achieve better performances on the NIH-AAPM dataset over popular uniform sampling, especially when the number of views is small. Finally, such a design also enables RoI-aware reconstruction with improved reconstruction quality within regions of interest (RoI's) that are clinically important. Experiments on the VerSe dataset demonstrate this ability of our sampling policy, which is difficult to achieve based on uniform sampling.
△ Less
Submitted 3 November, 2022;
originally announced November 2022.
-
Effects of Archive Size on Computation Time and Solution Quality for Multi-Objective Optimization
Authors:
Tianye Shu,
Ke Shang,
Hisao Ishibuchi,
Yang Nan
Abstract:
An unbounded external archive has been used to store all nondominated solutions found by an evolutionary multi-objective optimization algorithm in some studies. It has been shown that a selected solution subset from the stored solutions is often better than the final population. However, the use of the unbounded archive is not always realistic. When the number of examined solutions is huge, we mus…
▽ More
An unbounded external archive has been used to store all nondominated solutions found by an evolutionary multi-objective optimization algorithm in some studies. It has been shown that a selected solution subset from the stored solutions is often better than the final population. However, the use of the unbounded archive is not always realistic. When the number of examined solutions is huge, we must pre-specify the archive size. In this study, we examine the effects of the archive size on three aspects: (i) the quality of the selected final solution set, (ii) the total computation time for the archive maintenance and the final solution set selection, and (iii) the required memory size. Unsurprisingly, the increase of the archive size improves the final solution set quality. Interestingly, the total computation time of a medium-size archive is much larger than that of a small-size archive and a huge-size archive (e.g., an unbounded archive). To decrease the computation time, we examine two ideas: periodical archive update and archiving only in later generations. Compared with updating the archive at every generation, the first idea can obtain almost the same final solution set quality using a much shorter computation time at the cost of a slight increase of the memory size. The second idea drastically decreases the computation time at the cost of a slight deterioration of the final solution set quality. Based on our experimental results, some suggestions are given about how to appropriately choose an archiving strategy and an archive size.
△ Less
Submitted 2 November, 2022; v1 submitted 7 September, 2022;
originally announced September 2022.
-
Effective Drift Velocity from Turbulent Transport by Vorticity
Authors:
Hussein Aluie,
Shikhar Rai,
Hao Yin,
Aarne Lees,
Dongxiao Zhao,
Stephen M. Griffes,
Alistar Adcroft,
Jessica K. Shang
Abstract:
We highlight the differing roles of vorticity and strain in the transport of coarse-grained scalars at length-scales larger than $\ell$ by smaller scale (subscale) turbulence. %subscale flux/stress which appear in the evolution of coarse-grained (resolved) scalars/momentum account for the effect of (subgrid) scales smaller than the coarse-graining length $\ell$. We use the first term in a multisca…
▽ More
We highlight the differing roles of vorticity and strain in the transport of coarse-grained scalars at length-scales larger than $\ell$ by smaller scale (subscale) turbulence. %subscale flux/stress which appear in the evolution of coarse-grained (resolved) scalars/momentum account for the effect of (subgrid) scales smaller than the coarse-graining length $\ell$. We use the first term in a multiscale gradient expansion due to Eyink \cite{Eyink06a}, which exhibits excellent correlation with the exact subscale physics when the partitioning length $\ell$ is any scale smaller than that of the spectral peak. We show that unlike subscale strain, which acts as an anisotropic diffusion/anti-diffusion tensor, subscale vorticity's contribution is solely a conservative advection of coarse-grained quantities by an eddy-induced non-divergent velocity, $\bv_*$, that is proportional to the curl of vorticity. Therefore, material (Lagrangian) advection of coarse-grained quantities is accomplished not by the coarse-grained flow velocity, $\OL\bu_\ell$, but by the effective velocity, $\OL\bu_\ell+\bv_*$, the physics of which may improve commonly used LES models.
△ Less
Submitted 6 September, 2022;
originally announced September 2022.
-
HV-Net: Hypervolume Approximation based on DeepSets
Authors:
Ke Shang,
Weiyu Chen,
Weiduo Liao,
Hisao Ishibuchi
Abstract:
In this letter, we propose HV-Net, a new method for hypervolume approximation in evolutionary multi-objective optimization. The basic idea of HV-Net is to use DeepSets, a deep neural network with permutation invariant property, to approximate the hypervolume of a non-dominated solution set. The input of HV-Net is a non-dominated solution set in the objective space, and the output is an approximate…
▽ More
In this letter, we propose HV-Net, a new method for hypervolume approximation in evolutionary multi-objective optimization. The basic idea of HV-Net is to use DeepSets, a deep neural network with permutation invariant property, to approximate the hypervolume of a non-dominated solution set. The input of HV-Net is a non-dominated solution set in the objective space, and the output is an approximated hypervolume value of this solution set. The performance of HV-Net is evaluated through computational experiments by comparing it with two commonly-used hypervolume approximation methods (i.e., point-based method and line-based method). Our experimental results show that HV-Net outperforms the other two methods in terms of both the approximation error and the runtime, which shows the potential of using deep learning technique for hypervolume approximation.
△ Less
Submitted 4 March, 2022;
originally announced March 2022.
-
Learning to Approximate: Auto Direction Vector Set Generation for Hypervolume Contribution Approximation
Authors:
Ke Shang,
Tianye Shu,
Hisao Ishibuchi
Abstract:
Hypervolume contribution is an important concept in evolutionary multi-objective optimization (EMO). It involves in hypervolume-based EMO algorithms and hypervolume subset selection algorithms. Its main drawback is that it is computationally expensive in high-dimensional spaces, which limits its applicability to many-objective optimization. Recently, an R2 indicator variant (i.e.,…
▽ More
Hypervolume contribution is an important concept in evolutionary multi-objective optimization (EMO). It involves in hypervolume-based EMO algorithms and hypervolume subset selection algorithms. Its main drawback is that it is computationally expensive in high-dimensional spaces, which limits its applicability to many-objective optimization. Recently, an R2 indicator variant (i.e., $R_2^{\text{HVC}}$ indicator) is proposed to approximate the hypervolume contribution. The $R_2^{\text{HVC}}$ indicator uses line segments along a number of direction vectors for hypervolume contribution approximation. It has been shown that different direction vector sets lead to different approximation quality. In this paper, we propose \textit{Learning to Approximate (LtA)}, a direction vector set generation method for the $R_2^{\text{HVC}}$ indicator. The direction vector set is automatically learned from training data. The learned direction vector set can then be used in the $R_2^{\text{HVC}}$ indicator to improve its approximation quality. The usefulness of the proposed LtA method is examined by comparing it with other commonly-used direction vector set generation methods for the $R_2^{\text{HVC}}$ indicator. Experimental results suggest the superiority of LtA over the other methods for generating high quality direction vector sets.
△ Less
Submitted 17 January, 2022;
originally announced January 2022.
-
Benchmarking Subset Selection from Large Candidate Solution Sets in Evolutionary Multi-objective Optimization
Authors:
Ke Shang,
Tianye Shu,
Hisao Ishibuchi,
Yang Nan,
Lie Meng Pang
Abstract:
In the evolutionary multi-objective optimization (EMO) field, the standard practice is to present the final population of an EMO algorithm as the output. However, it has been shown that the final population often includes solutions which are dominated by other solutions generated and discarded in previous generations. Recently, a new EMO framework has been proposed to solve this issue by storing a…
▽ More
In the evolutionary multi-objective optimization (EMO) field, the standard practice is to present the final population of an EMO algorithm as the output. However, it has been shown that the final population often includes solutions which are dominated by other solutions generated and discarded in previous generations. Recently, a new EMO framework has been proposed to solve this issue by storing all the non-dominated solutions generated during the evolution in an archive and selecting a subset of solutions from the archive as the output. The key component in this framework is the subset selection from the archive which usually stores a large number of candidate solutions. However, most studies on subset selection focus on small candidate solution sets for environmental selection. There is no benchmark test suite for large-scale subset selection. This paper aims to fill this research gap by proposing a benchmark test suite for subset selection from large candidate solution sets, and comparing some representative methods using the proposed test suite. The proposed test suite together with the benchmarking studies provides a baseline for researchers to understand, use, compare, and develop subset selection methods in the EMO field.
△ Less
Submitted 29 March, 2022; v1 submitted 17 January, 2022;
originally announced January 2022.
-
DuDoTrans: Dual-Domain Transformer Provides More Attention for Sinogram Restoration in Sparse-View CT Reconstruction
Authors:
Ce Wang,
Kun Shang,
Haimiao Zhang,
Qian Li,
Yuan Hui,
S. Kevin Zhou
Abstract:
While Computed Tomography (CT) reconstruction from X-ray sinograms is necessary for clinical diagnosis, iodine radiation in the imaging process induces irreversible injury, thereby driving researchers to study sparse-view CT reconstruction, that is, recovering a high-quality CT image from a sparse set of sinogram views. Iterative models are proposed to alleviate the appeared artifacts in sparse-vi…
▽ More
While Computed Tomography (CT) reconstruction from X-ray sinograms is necessary for clinical diagnosis, iodine radiation in the imaging process induces irreversible injury, thereby driving researchers to study sparse-view CT reconstruction, that is, recovering a high-quality CT image from a sparse set of sinogram views. Iterative models are proposed to alleviate the appeared artifacts in sparse-view CT images, but the computation cost is too expensive. Then deep-learning-based methods have gained prevalence due to the excellent performances and lower computation. However, these methods ignore the mismatch between the CNN's \textbf{local} feature extraction capability and the sinogram's \textbf{global} characteristics. To overcome the problem, we propose \textbf{Du}al-\textbf{Do}main \textbf{Trans}former (\textbf{DuDoTrans}) to simultaneously restore informative sinograms via the long-range dependency modeling capability of Transformer and reconstruct CT image with both the enhanced and raw sinograms. With such a novel design, reconstruction performance on the NIH-AAPM dataset and COVID-19 dataset experimentally confirms the effectiveness and generalizability of DuDoTrans with fewer involved parameters. Extensive experiments also demonstrate its robustness with different noise-level scenarios for sparse-view CT reconstruction. The code and models are publicly available at https://github.com/DuDoTrans/CODE
△ Less
Submitted 25 November, 2021; v1 submitted 21 November, 2021;
originally announced November 2021.
-
Adaptive Similarity Function with Structural Features of Network Embedding for Missing Link Prediction
Authors:
Chuanting Zhang,
Ke-ke Shang,
Jingping Qiao
Abstract:
Link prediction is a fundamental problem of data science, which usually calls for unfolding the mechanisms that govern the micro-dynamics of networks. In this regard, using features obtained from network embedding for predicting links has drawn widespread attention. Though edge features-based or node similarity-based methods have been proposed to solve the link prediction problem, many technical c…
▽ More
Link prediction is a fundamental problem of data science, which usually calls for unfolding the mechanisms that govern the micro-dynamics of networks. In this regard, using features obtained from network embedding for predicting links has drawn widespread attention. Though edge features-based or node similarity-based methods have been proposed to solve the link prediction problem, many technical challenges still exist due to the unique structural properties of networks, especially when the networks are sparse. From the graph mining perspective, we first give empirical evidence of the inconsistency between heuristic and learned edge features. Then we propose a novel link prediction framework, AdaSim, by introducing an Adaptive Similarity function using features obtained from network embedding based on random walks. The node feature representations are obtained by optimizing a graph-based objective function. Instead of generating edge features using binary operators, we perform link prediction solely leveraging the node features of the network. We define a flexible similarity function with one tunable parameter, which serves as a penalty of the original similarity measure. The optimal value is learned through supervised learning thus is adaptive to data distribution. To evaluate the performance of our proposed algorithm, we conduct extensive experiments on eleven disparate networks of the real world. Experimental results show that AdaSim achieves better performance than state-of-the-art algorithms and is robust to different sparsities of the networks.
△ Less
Submitted 12 November, 2021;
originally announced November 2021.
-
Clustering-Based Subset Selection in Evolutionary Multiobjective Optimization
Authors:
Weiyu Chen,
Hisao Ishibuchi,
Ke Shang
Abstract:
Subset selection is an important component in evolutionary multiobjective optimization (EMO) algorithms. Clustering, as a classic method to group similar data points together, has been used for subset selection in some fields. However, clustering-based methods have not been evaluated in the context of subset selection from solution sets obtained by EMO algorithms. In this paper, we first review so…
▽ More
Subset selection is an important component in evolutionary multiobjective optimization (EMO) algorithms. Clustering, as a classic method to group similar data points together, has been used for subset selection in some fields. However, clustering-based methods have not been evaluated in the context of subset selection from solution sets obtained by EMO algorithms. In this paper, we first review some classic clustering algorithms. We also point out that another popular subset selection method, i.e., inverted generational distance (IGD)-based subset selection, can be viewed as clustering. Then, we perform a comprehensive experimental study to evaluate the performance of various clustering algorithms in different scenarios. Experimental results are analyzed in detail, and some suggestions about the use of clustering algorithms for subset selection are derived. Additionally, we demonstrate that decision maker's preference can be introduced to clustering-based subset selection.
△ Less
Submitted 29 August, 2021; v1 submitted 18 August, 2021;
originally announced August 2021.
-
Scaling of Turbulent Viscosity and Resistivity: Extracting a Scale-dependent Turbulent Magnetic Prandtl Number
Authors:
Xin Bian,
Jessica K. Shang,
Eric G. Blackman,
Gilbert W. Collins,
Hussein Aluie
Abstract:
Turbulent viscosity $ν_t$ and resistivity $η_t$ are perhaps the simplest models for turbulent transport of angular momentum and magnetic fields, respectively. The associated turbulent magnetic Prandtl number $Pr_t\equiv ν_t/η_t$ has been well recognized to determine the final magnetic configuration of accretion disks. Here, we present an approach to determining these ''effective transport'' coeffi…
▽ More
Turbulent viscosity $ν_t$ and resistivity $η_t$ are perhaps the simplest models for turbulent transport of angular momentum and magnetic fields, respectively. The associated turbulent magnetic Prandtl number $Pr_t\equiv ν_t/η_t$ has been well recognized to determine the final magnetic configuration of accretion disks. Here, we present an approach to determining these ''effective transport'' coefficients acting at different length-scales using coarse-graining and recent results on decoupled kinetic and magnetic energy cascades [Bian & Aluie 2019]. By analyzing the kinetic and magnetic energy cascades from a suite of high-resolution simulations, we show that our definitions of $ν_t$, $η_t$, and $Pr_t$ have power-law scalings in the ''decoupled range.'' We observe that $Pr_t\approx1 \text{~to~}2$ at the smallest inertial-inductive scales, increasing to $\approx 5$ at the largest scales. However, based on physical considerations, our analysis suggests that $Pr_t$ has to become scale-independent and of order unity in the decoupled range at sufficiently high Reynolds numbers (or grid-resolution), and that the power-law scaling exponents of velocity and magnetic spectra become equal. In addition to implications to astrophysical systems, the scale-dependent turbulent transport coefficients offer a guide for large eddy simulation modeling.
△ Less
Submitted 2 July, 2021;
originally announced July 2021.
-
Hypervolume-Optimal $μ$-Distributions on Line/Plane-based Pareto Fronts in Three Dimensions
Authors:
Ke Shang,
Hisao Ishibuchi,
Weiyu Chen,
Yang Nan,
Weiduo Liao
Abstract:
Hypervolume is widely used in the evolutionary multi-objective optimization (EMO) field to evaluate the quality of a solution set. For a solution set with $μ$ solutions on a Pareto front, a larger hypervolume means a better solution set. Investigating the distribution of the solution set with the largest hypervolume is an important topic in EMO, which is the so-called hypervolume optimal $μ$-distr…
▽ More
Hypervolume is widely used in the evolutionary multi-objective optimization (EMO) field to evaluate the quality of a solution set. For a solution set with $μ$ solutions on a Pareto front, a larger hypervolume means a better solution set. Investigating the distribution of the solution set with the largest hypervolume is an important topic in EMO, which is the so-called hypervolume optimal $μ$-distribution. Theoretical results have shown that the $μ$ solutions are uniformly distributed on a linear Pareto front in two dimensions. However, the $μ$ solutions are not always uniformly distributed on a single-line Pareto front in three dimensions. They are only uniform when the single-line Pareto front has one constant objective. In this paper, we further investigate the hypervolume optimal $μ$-distribution in three dimensions. We consider the line- and plane-based Pareto fronts. For the line-based Pareto fronts, we extend the single-line Pareto front to two-line and three-line Pareto fronts, where each line has one constant objective. For the plane-based Pareto fronts, the linear triangular and inverted triangular Pareto fronts are considered. First, we show that the $μ$ solutions are not always uniformly distributed on the line-based Pareto fronts. The uniformity depends on how the lines are combined. Then, we show that a uniform solution set on the plane-based Pareto front is not always optimal for hypervolume maximization. It is locally optimal with respect to a $(μ+1)$ selection scheme. Our results can help researchers in the community to better understand and utilize the hypervolume indicator.
△ Less
Submitted 19 April, 2021;
originally announced April 2021.
-
Improving Generalizability in Limited-Angle CT Reconstruction with Sinogram Extrapolation
Authors:
Ce Wang,
Haimiao Zhang,
Qian Li,
Kun Shang,
Yuanyuan Lyu,
Bin Dong,
S. Kevin Zhou
Abstract:
Computed tomography (CT) reconstruction from X-ray projections acquired within a limited angle range is challenging, especially when the angle range is extremely small. Both analytical and iterative models need more projections for effective modeling. Deep learning methods have gained prevalence due to their excellent reconstruction performances, but such success is mainly limited within the same…
▽ More
Computed tomography (CT) reconstruction from X-ray projections acquired within a limited angle range is challenging, especially when the angle range is extremely small. Both analytical and iterative models need more projections for effective modeling. Deep learning methods have gained prevalence due to their excellent reconstruction performances, but such success is mainly limited within the same dataset and does not generalize across datasets with different distributions. Hereby we propose ExtraPolationNetwork for limited-angle CT reconstruction via the introduction of a sinogram extrapolation module, which is theoretically justified. The module complements extra sinogram information and boots model generalizability. Extensive experimental results show that our reconstruction model achieves state-of-the-art performance on NIH-AAPM dataset, similar to existing approaches. More importantly, we show that using such a sinogram extrapolation module significantly improves the generalization capability of the model on unseen datasets (e.g., COVID-19 and LIDC datasets) when compared to existing approaches.
△ Less
Submitted 17 November, 2021; v1 submitted 9 March, 2021;
originally announced March 2021.
-
Fast Greedy Subset Selection from Large Candidate Solution Sets in Evolutionary Multi-objective Optimization
Authors:
Weiyu Chen,
Hisao Ishibuchi,
Ke Shang
Abstract:
Subset selection is an interesting and important topic in the field of evolutionary multi-objective optimization (EMO). Especially, in an EMO algorithm with an unbounded external archive, subset selection is an essential post-processing procedure to select a pre-specified number of solutions as the final result. In this paper, we discuss the efficiency of greedy subset selection for the hypervolum…
▽ More
Subset selection is an interesting and important topic in the field of evolutionary multi-objective optimization (EMO). Especially, in an EMO algorithm with an unbounded external archive, subset selection is an essential post-processing procedure to select a pre-specified number of solutions as the final result. In this paper, we discuss the efficiency of greedy subset selection for the hypervolume, IGD and IGD+ indicators. Greedy algorithms usually efficiently handle subset selection. However, when a large number of solutions are given (e.g., subset selection from tens of thousands of solutions in an unbounded external archive), they often become time-consuming. Our idea is to use the submodular property, which is known for the hypervolume indicator, to improve their efficiency. First, we prove that the IGD and IGD+ indicators are also submodular. Next, based on the submodular property, we propose an efficient greedy inclusion algorithm for each indicator. Then, we demonstrate through computational experiments that the proposed algorithms are much faster than the standard greedy subset selection algorithms.
△ Less
Submitted 1 February, 2021;
originally announced February 2021.
-
Peristaltic pumping in sub-wavelength channels
Authors:
Jessica K Shang,
J Brennen Carr,
Caroline D Cardinale,
Delin Zeng
Abstract:
We apply the lubrication approximation to solve for the flow generated by a peristaltic traveling wave in a finite, planar channel, and examine the effect of channel length. Cerebrospinal fluid (CSF) is hypothesized to be peristaltically transported by arterial pulsations through the perivascular spaces in the brain. Previous studies of peristaltic perivascular models have chosen model lengths ran…
▽ More
We apply the lubrication approximation to solve for the flow generated by a peristaltic traveling wave in a finite, planar channel, and examine the effect of channel length. Cerebrospinal fluid (CSF) is hypothesized to be peristaltically transported by arterial pulsations through the perivascular spaces in the brain. Previous studies of peristaltic perivascular models have chosen model lengths ranging from sub-wavelength, which is more physiologically realistic, to full wavelength. Here, we solve for peristaltic flow rates for arbitrary lengths, and find that sub-wavelength channels significantly modulate the mean value, phase, and amplitude of flow rate for sinusoidal and general peristaltic waveforms. The boundary conditions create an internal pressure gradient such that the instantaneous flow rate varies along the length of the channel, and the difference between the ends and the middle of the channel is more pronounced for very short channels. This longitudinal distribution in flow rate is not observed \emph{in vivo} in perivascular spaces at the surface of the brain, and hence sub-wavelength peristaltic models whose boundary conditions are isolated from the larger perivascular network are limited in their ability to reproduce perivascular flows.
△ Less
Submitted 14 June, 2021; v1 submitted 24 December, 2020;
originally announced December 2020.
-
Evolutionary Multi-Objective Optimization Algorithm Framework with Three Solution Sets
Authors:
Hisao Ishibuchi,
Lie Meng Pang,
Ke Shang
Abstract:
It is assumed in the evolutionary multi-objective optimization (EMO) community that a final solution is selected by a decision maker from a non-dominated solution set obtained by an EMO algorithm. The number of solutions to be presented to the decision maker can be totally different. In some cases, the decision maker may want to examine only a few representative solutions from which a final soluti…
▽ More
It is assumed in the evolutionary multi-objective optimization (EMO) community that a final solution is selected by a decision maker from a non-dominated solution set obtained by an EMO algorithm. The number of solutions to be presented to the decision maker can be totally different. In some cases, the decision maker may want to examine only a few representative solutions from which a final solution is selected. In other cases, a large number of non-dominated solutions may be needed to visualize the Pareto front. In this paper, we suggest the use of a general EMO framework with three solution sets to handle various situations with respect to the required number of solutions. The three solution sets are the main population of an EMO algorithm, an external archive to store promising solutions, and a final solution set which is presented to the decision maker. The final solution set is selected from the archive. Thus the population size and the archive size can be arbitrarily specified as long as the archive size is not smaller than the required number of solutions. The final population is not necessarily to be a good solution set since it is not presented to the decision maker. Through computational experiments, we show the advantages of this framework over the standard final population and final archive frameworks. We also discuss how to select a final solution set and how to explain the reason for the selection, which is the first attempt towards an explainable EMO framework.
△ Less
Submitted 14 December, 2020;
originally announced December 2020.
-
Decomposition-Based Multi-Objective Evolutionary Algorithm Design under Two Algorithm Frameworks
Authors:
Lie Meng Pang,
Hisao Ishibuchi,
Ke Shang
Abstract:
The development of efficient and effective evolutionary multi-objective optimization (EMO) algorithms has been an active research topic in the evolutionary computation community. Over the years, many EMO algorithms have been proposed. The existing EMO algorithms are mainly developed based on the final population framework. In the final population framework, the final population of an EMO algorithm…
▽ More
The development of efficient and effective evolutionary multi-objective optimization (EMO) algorithms has been an active research topic in the evolutionary computation community. Over the years, many EMO algorithms have been proposed. The existing EMO algorithms are mainly developed based on the final population framework. In the final population framework, the final population of an EMO algorithm is presented to the decision maker. Thus, it is required that the final population produced by an EMO algorithm is a good solution set. Recently, the use of solution selection framework was suggested for the design of EMO algorithms. This framework has an unbounded external archive to store all the examined solutions. A pre-specified number of solutions are selected from the archive as the final solutions presented to the decision maker. When the solution selection framework is used, EMO algorithms can be designed in a more flexible manner since the final population is not necessarily to be a good solution set. In this paper, we examine the design of MOEA/D under these two frameworks. We use an offline genetic algorithm-based hyper-heuristic method to find the optimal configuration of MOEA/D in each framework. The DTLZ and WFG test suites and their minus versions are used in our experiments. The experimental results suggest the possibility that a more flexible, robust and high-performance MOEA/D algorithm can be obtained when the solution selection framework is used.
△ Less
Submitted 17 August, 2020;
originally announced August 2020.
-
Peristaltic pumping in thin, non-axisymmetric, annular tubes
Authors:
J. Brennen Carr,
John H. Thomas,
Jia Liu,
Jessica K. Shang
Abstract:
Two-dimensional laminar flow of a viscous fluid induced by peristalsis due to a moving wall wave has been studied previously for a rectangular channel, a circular tube, and a concentric circular annulus. Here we study peristaltic flow in a non-axisymmetric annular tube, where the flow is three dimensional, with azimuthal motions. This geometry is motivated by experimental observations of cerebrosp…
▽ More
Two-dimensional laminar flow of a viscous fluid induced by peristalsis due to a moving wall wave has been studied previously for a rectangular channel, a circular tube, and a concentric circular annulus. Here we study peristaltic flow in a non-axisymmetric annular tube, where the flow is three dimensional, with azimuthal motions. This geometry is motivated by experimental observations of cerebrospinal fluid flow along perivascular spaces (PVSs) surrounding arteries in the brain, which is at least partially driven by peristaltic pumping. These PVSs are well matched, in cross-section, by an adjustable model consisting of an inner circle (arterial wall) and an outer ellipse (outer edge of the PVS), not necessarily concentric. We use this model, which may have other applications, as a basis for numerical simulations of peristaltic flow. We use a finite-element scheme to compute the flow driven by a propagating sinusoidal radial displacement of the inner wall. Unlike peristaltic flow in a concentric circular annulus, the flow is fully three-dimensional, with streamlines wiggling in both the radial and axial directions. We examine the dependence of the flow on the elongation of the outer elliptical wall and on the eccentricity of the configuration. We find that time-averaged volumetric flow decreases with increasing ellipticity or eccentricity. Azimuthal pressure variations, caused by the wall wave, drive an oscillatory azimuthal flow in and out of the narrower gaps. The additional shearing motion in the azimuthal direction will enhance Taylor dispersion in these flows, an effect that might have practical applications.
△ Less
Submitted 29 July, 2020;
originally announced July 2020.
-
Algorithm Configurations of MOEA/D with an Unbounded External Archive
Authors:
Lie Meng Pang,
Hisao Ishibuchi,
Ke Shang
Abstract:
In the evolutionary multi-objective optimization (EMO) community, it is usually assumed that the final population is presented to the decision maker as the result of the execution of an EMO algorithm. Recently, an unbounded external archive was used to evaluate the performance of EMO algorithms in some studies where a pre-specified number of solutions are selected from all the examined non-dominat…
▽ More
In the evolutionary multi-objective optimization (EMO) community, it is usually assumed that the final population is presented to the decision maker as the result of the execution of an EMO algorithm. Recently, an unbounded external archive was used to evaluate the performance of EMO algorithms in some studies where a pre-specified number of solutions are selected from all the examined non-dominated solutions. In this framework, which is referred to as the solution selection framework, the final population does not have to be a good solution set. Thus, the solution selection framework offers higher flexibility to the design of EMO algorithms than the final population framework. In this paper, we examine the design of MOEA/D under these two frameworks. First, we show that the performance of MOEA/D is improved by linearly changing the reference point specification during its execution through computational experiments with various combinations of initial and final specifications. Robust and high performance of the solution selection framework is observed. Then, we examine the use of a genetic algorithm-based offline hyper-heuristic method to find the best configuration of MOEA/D in each framework. Finally, we further discuss solution selection after the execution of an EMO algorithm in the solution selection framework.
△ Less
Submitted 27 July, 2020;
originally announced July 2020.
-
Lazy Greedy Hypervolume Subset Selection from Large Candidate Solution Sets
Authors:
Weiyu Chen,
Hisao Ishibuhci,
Ke Shang
Abstract:
Subset selection is a popular topic in recent years and a number of subset selection methods have been proposed. Among those methods, hypervolume subset selection is widely used. Greedy hypervolume subset selection algorithms can achieve good approximations to the optimal subset. However, when the candidate set is large (e.g., an unbounded external archive with a large number of solutions), the al…
▽ More
Subset selection is a popular topic in recent years and a number of subset selection methods have been proposed. Among those methods, hypervolume subset selection is widely used. Greedy hypervolume subset selection algorithms can achieve good approximations to the optimal subset. However, when the candidate set is large (e.g., an unbounded external archive with a large number of solutions), the algorithm is very time-consuming. In this paper, we propose a new lazy greedy algorithm exploiting the submodular property of the hypervolume indicator. The core idea is to avoid unnecessary hypervolume contribution calculation when finding the solution with the largest contribution. Experimental results show that the proposed algorithm is hundreds of times faster than the original greedy inclusion algorithm and several times faster than the fastest known greedy inclusion algorithm on many test problems.
△ Less
Submitted 4 July, 2020;
originally announced July 2020.
-
Solution Subset Selection for Final Decision Making in Evolutionary Multi-Objective Optimization
Authors:
Hisao Ishibuchi,
Lie Meng Pang,
Ke Shang
Abstract:
In general, a multi-objective optimization problem does not have a single optimal solution but a set of Pareto optimal solutions, which forms the Pareto front in the objective space. Various evolutionary algorithms have been proposed to approximate the Pareto front using a pre-specified number of solutions. Hundreds of solutions are obtained by their single run. The selection of a single final sol…
▽ More
In general, a multi-objective optimization problem does not have a single optimal solution but a set of Pareto optimal solutions, which forms the Pareto front in the objective space. Various evolutionary algorithms have been proposed to approximate the Pareto front using a pre-specified number of solutions. Hundreds of solutions are obtained by their single run. The selection of a single final solution from the obtained solutions is assumed to be done by a human decision maker. However, in many cases, the decision maker does not want to examine hundreds of solutions. Thus, it is needed to select a small subset of the obtained solutions. In this paper, we discuss subset selection from a viewpoint of the final decision making. First we briefly explain existing subset selection studies. Next we formulate an expected loss function for subset selection. We also show that the formulated function is the same as the IGD plus indicator. Then we report experimental results where the proposed approach is compared with other indicator-based subset selection methods.
△ Less
Submitted 15 June, 2020;
originally announced June 2020.
-
Effects of Discretization of Decision and Objective Spaces on the Performance of Evolutionary Multiobjective Optimization Algorithms
Authors:
Weiyu Chen,
Hisao Ishibuchi,
Ke Shang
Abstract:
Recently, the discretization of decision and objective spaces has been discussed in the literature. In some studies, it is shown that the decision space discretization improves the performance of evolutionary multi-objective optimization (EMO) algorithms on continuous multi-objective test problems. In other studies, it is shown that the objective space discretization improves the performance on co…
▽ More
Recently, the discretization of decision and objective spaces has been discussed in the literature. In some studies, it is shown that the decision space discretization improves the performance of evolutionary multi-objective optimization (EMO) algorithms on continuous multi-objective test problems. In other studies, it is shown that the objective space discretization improves the performance on combinatorial multi-objective problems. However, the effect of the simultaneous discretization of both spaces has not been examined in the literature. In this paper, we examine the effects of the decision space discretization, objective space discretization and simultaneous discretization on the performance of NSGA-II through computational experiments on the DTLZ and WFG problems. Using various settings about the number of decision variables and the number of objectives, our experiments are performed on four types of problems: standard problems, large-scale problems, many-objective problems, and large-scale many-objective problems. We show that the decision space discretization has a positive effect for large-scale problems and the objective space discretization has a positive effect for many-objective problems. We also show the discretization of both spaces is useful for large-scale many-objective problems.
△ Less
Submitted 22 March, 2020;
originally announced March 2020.
-
Building and Testing Yield Curve Generators for P&C Insurance
Authors:
Gary Venter,
Kailan Shang
Abstract:
Interest-rate risk is a key factor for property-casualty insurer capital. P&C companies tend to be highly leveraged, with bond holdings much greater than capital. For GAAP capital, bonds are marked to market but liabilities are not, so shifts in the yield curve can have a significant impact on capital. Yield-curve scenario generators are one approach to quantifying this risk. They produce many fut…
▽ More
Interest-rate risk is a key factor for property-casualty insurer capital. P&C companies tend to be highly leveraged, with bond holdings much greater than capital. For GAAP capital, bonds are marked to market but liabilities are not, so shifts in the yield curve can have a significant impact on capital. Yield-curve scenario generators are one approach to quantifying this risk. They produce many future simulated evolutions of the yield curve, which can be used to quantify the probabilities of bond-value changes that would result from various maturity-mix strategies. Some of these generators are provided as black-box models where the user gets only the projected scenarios. One focus of this paper is to provide methods for testing generated scenarios from such models by comparing to known distributional properties of yield curves.
P&C insurers hold bonds to maturity and manage cash-flow risk by matching asset and liability flows. Derivative pricing and stochastic volatility are of little concern over the relevant time frames. This requires different models and model testing than what is common in the broader financial markets.
To complicate things further, interest rates for the last decade have not been following the patterns established in the sixty years following WWII. We are now coming out of the period of very low rates, yet are still not returning to what had been thought of as normal before that. Modeling and model testing are in an evolving state while new patterns emerge.
Our analysis starts with a review of the literature on interest-rate model testing, with a P&C focus, and an update of the tests for current market behavior. We then discuss models, and use them to illustrate the fitting and testing methods. The testing discussion does not require the model-building section.
△ Less
Submitted 22 December, 2019;
originally announced December 2019.
-
A novel metric for community detection
Authors:
Ke-ke Shang,
Michael Small,
Yan Wang,
Di Yin,
Shu Li
Abstract:
Research into detection of dense communities has recently attracted increasing attention within network science, various metrics for detection of such communities have been proposed. The most popular metric -- Modularity -- is based on the so-called rule that the links within communities are denser than external links among communities, has become the default. However, this default metric suffers…
▽ More
Research into detection of dense communities has recently attracted increasing attention within network science, various metrics for detection of such communities have been proposed. The most popular metric -- Modularity -- is based on the so-called rule that the links within communities are denser than external links among communities, has become the default. However, this default metric suffers from ambiguity, and worse, all augmentations of modularity and based on a narrow intuition of what it means to form a "community". We argue that in specific, but quite common systems, links within a community are not necessarily more common than links between communities. Instead we propose that the defining characteristic of a community is that links are more predictable within a community rather than between communities. In this paper, based on the effect of communities on link prediction, we propose a novel metric for the community detection based directly on this feature. We find that our metric is more robustness than traditional modularity. Consequently, we can achieve an evaluation of algorithm stability for the same detection algorithm in different networks. Our metric also can directly uncover the false community detection, and infer more statistical characteristics for detection algorithms.
△ Less
Submitted 26 September, 2019;
originally announced September 2019.
-
The key to the weak-ties phenomenon
Authors:
Ke-ke Shang,
Michael Small,
Di Yin,
Yan Wang,
Tong-chen Li
Abstract:
The study of the weak-ties phenomenon has a long and well documented history, research into the application of this social phenomenon has recently attracted increasing attention. However, further exploration of the reasons behind the weak-ties phenomenon is still challenging. Fortunately, data-driven network science provides a novel way with substantial explanatory power to analyze the causal mech…
▽ More
The study of the weak-ties phenomenon has a long and well documented history, research into the application of this social phenomenon has recently attracted increasing attention. However, further exploration of the reasons behind the weak-ties phenomenon is still challenging. Fortunately, data-driven network science provides a novel way with substantial explanatory power to analyze the causal mechanism behind social phenomenon. Inspired by this perspective, we propose an approach to further explore the driving factors behind the temporal weak-ties phenomenon. We find that the obvious intuition underlying the weak-ties phenomenon is incorrect, and often large numbers of unknown mutual friends associated with these weak ties is one of the key reason for the emergence of the weak-ties phenomenon. In particular, for example scientific collaborators with weak ties prefer to be involved in direct collaboration rather than share ideas with mutual colleagues -- there is a natural tendency to collapse short strong chains of connection.
△ Less
Submitted 9 June, 2019;
originally announced June 2019.
-
Robust Principal Component Analysis for Modal Decomposition of Corrupt Fluid Flows
Authors:
Isabel Scherl,
Benjamin Strom,
Jessica K. Shang,
Owen Williams,
Brian L. Polagye,
Steven L. Brunton
Abstract:
Modal analysis techniques are used to identify patterns and develop reduced-order models in a variety of fluid applications. However, experimentally acquired flow fields may be corrupted with incorrect and missing entries, which may degrade modal decomposition. Here we use robust principal component analysis (RPCA) to improve the quality of flow field data by leveraging global coherent structures…
▽ More
Modal analysis techniques are used to identify patterns and develop reduced-order models in a variety of fluid applications. However, experimentally acquired flow fields may be corrupted with incorrect and missing entries, which may degrade modal decomposition. Here we use robust principal component analysis (RPCA) to improve the quality of flow field data by leveraging global coherent structures to identify and replace spurious data points. RPCA is a robust variant of principal component analysis (PCA), also known as proper orthogonal decomposition (POD) in fluids, that decomposes a data matrix into the sum of a low-rank matrix containing coherent structures and a sparse matrix of outliers and corrupt entries. We apply RPCA filtering to a range of fluid simulations and experiments of varying complexities and assess the accuracy of low-rank structure recovery. First, we analyze direct numerical simulations of flow past a circular cylinder at Reynolds number 100 with artificial outliers, alongside similar PIV measurements at Reynolds number 413. Next, we apply RPCA filtering to a turbulent channel flow simulation from the Johns Hopkins Turbulence database, demonstrating that dominant coherent structures are preserved in the low-rank matrix. Finally, we investigate PIV measurements behind a two-bladed cross-flow turbine that exhibits both broadband and coherent phenomena. In all cases, we find that RPCA filtering extracts dominant coherent structures and identifies and fills in incorrect or missing measurements. The performance is particularly striking when flow fields are analyzed using dynamic mode decomposition, which is sensitive to noise and outliers.
△ Less
Submitted 13 December, 2019; v1 submitted 16 May, 2019;
originally announced May 2019.
-
Label-Removed Generative Adversarial Networks Incorporating with K-Means
Authors:
Ce Wang,
Zhangling Chen,
Kun Shang
Abstract:
Generative Adversarial Networks (GANs) have achieved great success in generating realistic images. Most of these are conditional models, although acquisition of class labels is expensive and time-consuming in practice. To reduce the dependence on labeled data, we propose an un-conditional generative adversarial model, called K-Means-GAN (KM-GAN), which incorporates the idea of updating centers in…
▽ More
Generative Adversarial Networks (GANs) have achieved great success in generating realistic images. Most of these are conditional models, although acquisition of class labels is expensive and time-consuming in practice. To reduce the dependence on labeled data, we propose an un-conditional generative adversarial model, called K-Means-GAN (KM-GAN), which incorporates the idea of updating centers in K-Means into GANs. Specifically, we redesign the framework of GANs by applying K-Means on the features extracted from the discriminator. With obtained labels from K-Means, we propose new objective functions from the perspective of deep metric learning (DML). Distinct from previous works, the discriminator is treated as a feature extractor rather than a classifier in KM-GAN, meanwhile utilization of K-Means makes features of the discriminator more representative. Experiments are conducted on various datasets, such as MNIST, Fashion-10, CIFAR-10 and CelebA, and show that the quality of samples generated by KM-GAN is comparable to some conditional generative adversarial models.
△ Less
Submitted 19 February, 2019;
originally announced February 2019.
-
A Light-weight Vibrational Motor Powered Recoil Robot that Hops Rapidly Across Granular Media
Authors:
Alice C. Quillen,
Randal C. Nelson,
Hesam Askari,
Kathryn Chotkowski,
Esteban Wright,
Jessica K. Shang
Abstract:
A 1 cm coin vibrational motor fixed to the center of a 4 cm square foam platform moves rapidly across granular media (poppy seeds, millet, corn meal) at a speed of up to 30 cm/s, or about 5 body lengths/s. Fast speeds are achieved with dimensionless acceleration number, similar to a Froude number, up to 50, allowing the light-weight 1.4 g mechanism to remain above the substrate, levitated and prop…
▽ More
A 1 cm coin vibrational motor fixed to the center of a 4 cm square foam platform moves rapidly across granular media (poppy seeds, millet, corn meal) at a speed of up to 30 cm/s, or about 5 body lengths/s. Fast speeds are achieved with dimensionless acceleration number, similar to a Froude number, up to 50, allowing the light-weight 1.4 g mechanism to remain above the substrate, levitated and propelled by its kicks off the surface. The mechanism is low cost and moves without any external moving parts. With 2 s exposures we photograph the trajectory of the mechanism using an LED blocked except for a pin-hole and fixed to the mechanism. Trajectories can exhibit period doubling phenomena similar to a ball bouncing on a vibrating table top. A two dimensional numerical model gives similar trajectories, though a vertical drag force is required to keep the mechanism height low. We attribute the vertical drag force to aerodynamic suction from air flow below the mechanism base and through the granular substrate. Our numerical model suggests that speed is maximized when the mechanism is prevented from jumping high off the surface. In this way the mechanism resembles a galloping or jumping animal whose body remains nearly at the same height above the ground during its gait.
△ Less
Submitted 23 September, 2018;
originally announced October 2018.
-
R2-based Hypervolume Contribution Approximation
Authors:
Ke Shang,
Hisao Ishibuchi,
Xizi Ni
Abstract:
In this letter, a new hypervolume contribution approximation method is proposed which is formulated as an R2 indicator. The basic idea of the proposed method is to use different line segments only in the hypervolume contribution region for the hypervolume contribution approximation. Comparing with a traditional method which is based on the R2 indicator to approximate the hypervolume, the new metho…
▽ More
In this letter, a new hypervolume contribution approximation method is proposed which is formulated as an R2 indicator. The basic idea of the proposed method is to use different line segments only in the hypervolume contribution region for the hypervolume contribution approximation. Comparing with a traditional method which is based on the R2 indicator to approximate the hypervolume, the new method can directly approximate the hypervolume contribution and will utilize all the direction vectors only in the hypervolume contribution region. The new method, the traditional method and the Monte Carlo sampling method together with two exact methods are compared through comprehensive experiments. Our results show the advantages of the new method over the other methods. Comparing with the other two approximation methods, the new method achieves the best performance for comparing hypervolume contributions of different solutions and identifying the solution with the smallest hypervolume contribution. Comparing with the exact methods, the new method is computationally efficient in high-dimensional spaces where the exact methods are impractical to use.
△ Less
Submitted 27 February, 2019; v1 submitted 15 May, 2018;
originally announced May 2018.
-
Integrated cladding-pumped multi-core, few-mode erbium-doped fibre amplifier for space-division multiplexed communications
Authors:
H. Chen,
C. Jin,
B. Huang,
N. K. Fontaine,
R. Ryf,
K. Shang,
N. Grégoire,
S. Morency,
R. -J. Essiambre,
G. Li,
Y. Messaddeq,
S. LaRochelle
Abstract:
Space-division multiplexing (SDM), whereby multiple spatial channels in multimode and multicore optical fibers are used to increase the total transmission capacity per fiber, is being investigated to avert a data capacity crunch and reduce the cost per transmitted bit. With the number of channels employed in SDM transmission experiments continuing to rise, there is a requirement for integrated SDM…
▽ More
Space-division multiplexing (SDM), whereby multiple spatial channels in multimode and multicore optical fibers are used to increase the total transmission capacity per fiber, is being investigated to avert a data capacity crunch and reduce the cost per transmitted bit. With the number of channels employed in SDM transmission experiments continuing to rise, there is a requirement for integrated SDM components that are scalable. Here, we demonstrate a cladding-pumped SDM erbium-doped fiber amplifier (EDFA) that consists of six uncoupled multimode erbium-doped cores. Each core supports three spatial modes, which enables the EDFA to amplify a total of 18 spatial channels simultaneously with a single pump diode and a complexity similar to a single-mode EDFA. The amplifier delivers more than 20-dBm total output power per core and less than 7-dB noise figure over the C-band. This cladding-pumped EDFA enables combined space-division and wavelength-division multiplexed transmission over multiple multimode fiber spans.
△ Less
Submitted 19 March, 2017;
originally announced March 2017.
-
Two-stage robust optimization for orienteering problem with stochastic weights
Authors:
Ke Shang,
Felix T. S. Chan,
Stephen Karungaru,
Kenji Terada,
Zuren Feng,
Liangjun Ke
Abstract:
In this paper, the two-stage orienteering problem with stochastic weights (OPSW) is considered, where the first-stage problem is to plan a path under the uncertain environment and the second-stage problem is recourse action to make sure that the length constraint is satisfied after the uncertainty is realized. Two recourse models are introduced based on two different uncertainty realization ways,…
▽ More
In this paper, the two-stage orienteering problem with stochastic weights (OPSW) is considered, where the first-stage problem is to plan a path under the uncertain environment and the second-stage problem is recourse action to make sure that the length constraint is satisfied after the uncertainty is realized. Two recourse models are introduced based on two different uncertainty realization ways, one is based on sequentially realized weights which leads to the recourse model proposed by Evers et al. (2014) and the other is based on concurrently realized weights which leads to a new recourse model with less variables and less constraints and is computationally more efficient. Subsequently two two-stage robust models are introduced for OPSW based on the two different recourse models, and the relationships between the two-stage robust models and their corresponding static robsut models are investigated. Theoretical conclusions are drawn which show that the two-stage robust models are equivalent to their corresponding static robust models with the box uncertainty set defined, and the two two-stage robust models are also equivalent to each other even though they are based on different recourse models. A case study is presented by comparing the two-stage robust models with an one-stage robust model for OPSW. The numerical results of the comparative studies show the effectiveness and superiority of the proposed two-stage robust models for dealing with the two-stage OPSW.
△ Less
Submitted 13 April, 2017; v1 submitted 31 December, 2016;
originally announced January 2017.