-
Byzantine Outside, Curious Inside: Reconstructing Data Through Malicious Updates
Authors:
Kai Yue,
Richeng Jin,
Chau-Wai Wong,
Huaiyu Dai
Abstract:
Federated learning (FL) enables decentralized machine learning without sharing raw data, allowing multiple clients to collaboratively learn a global model. However, studies reveal that privacy leakage is possible under commonly adopted FL protocols. In particular, a server with access to client gradients can synthesize data resembling the clients' training data. In this paper, we introduce a novel…
▽ More
Federated learning (FL) enables decentralized machine learning without sharing raw data, allowing multiple clients to collaboratively learn a global model. However, studies reveal that privacy leakage is possible under commonly adopted FL protocols. In particular, a server with access to client gradients can synthesize data resembling the clients' training data. In this paper, we introduce a novel threat model in FL, named the maliciously curious client, where a client manipulates its own gradients with the goal of inferring private data from peers. This attacker uniquely exploits the strength of a Byzantine adversary, traditionally aimed at undermining model robustness, and repurposes it to facilitate data reconstruction attack. We begin by formally defining this novel client-side threat model and providing a theoretical analysis that demonstrates its ability to achieve significant reconstruction success during FL training. To demonstrate its practical impact, we further develop a reconstruction algorithm that combines gradient inversion with malicious update strategies. Our analysis and experimental results reveal a critical blind spot in FL defenses: both server-side robust aggregation and client-side privacy mechanisms may fail against our proposed attack. Surprisingly, standard server- and client-side defenses designed to enhance robustness or privacy may unintentionally amplify data leakage. Compared to the baseline approach, a mistakenly used defense may instead improve the reconstructed image quality by 10-15%.
△ Less
Submitted 12 June, 2025;
originally announced June 2025.
-
Zero-Shot Vision Encoder Grafting via LLM Surrogates
Authors:
Kaiyu Yue,
Vasu Singla,
Menglin Jia,
John Kirchenbauer,
Rifaa Qadri,
Zikui Cai,
Abhinav Bhatele,
Furong Huang,
Tom Goldstein
Abstract:
Vision language models (VLMs) typically pair a modestly sized vision encoder with a large language model (LLM), e.g., Llama-70B, making the decoder the primary computational burden during training. To reduce costs, a potential promising strategy is to first train the vision encoder using a small language model before transferring it to the large one. We construct small "surrogate models" that shar…
▽ More
Vision language models (VLMs) typically pair a modestly sized vision encoder with a large language model (LLM), e.g., Llama-70B, making the decoder the primary computational burden during training. To reduce costs, a potential promising strategy is to first train the vision encoder using a small language model before transferring it to the large one. We construct small "surrogate models" that share the same embedding space and representation language as the large target LLM by directly inheriting its shallow layers. Vision encoders trained on the surrogate can then be directly transferred to the larger model, a process we call zero-shot grafting -- when plugged directly into the full-size target LLM, the grafted pair surpasses the encoder-surrogate pair and, on some benchmarks, even performs on par with full decoder training with the target LLM. Furthermore, our surrogate training approach reduces overall VLM training costs by ~45% when using Llama-70B as the decoder.
△ Less
Submitted 28 May, 2025;
originally announced May 2025.
-
Deep Learning-Based Wideband Spectrum Sensing with Dual-Representation Inputs and Subband Shuffling Augmentation
Authors:
Shilian Zheng,
Zhihao Ye,
Luxin Zhang,
Keqiang Yue,
Zhijin Zhao
Abstract:
The widespread adoption of mobile communication technology has led to a severe shortage of spectrum resources, driving the development of cognitive radio technologies aimed at improving spectrum utilization, with spectrum sensing being the key enabler. This paper presents a novel deep learning-based wideband spectrum sensing framework that leverages multi-taper power spectral inputs to achieve hig…
▽ More
The widespread adoption of mobile communication technology has led to a severe shortage of spectrum resources, driving the development of cognitive radio technologies aimed at improving spectrum utilization, with spectrum sensing being the key enabler. This paper presents a novel deep learning-based wideband spectrum sensing framework that leverages multi-taper power spectral inputs to achieve high-precision and sample-efficient sensing. To enhance sensing accuracy, we incorporate a feature fusion strategy that combines multiple power spectrum representations. To tackle the challenge of limited sample sizes, we propose two data augmentation techniques designed to expand the training set and improve the network's detection probability. Comprehensive simulation results demonstrate that our method outperforms existing approaches, particularly in low signal-to-noise ratio conditions, achieving higher detection probabilities and lower false alarm rates. The method also exhibits strong robustness across various scenarios, highlighting its significant potential for practical applications in wireless communication systems.
△ Less
Submitted 9 April, 2025;
originally announced April 2025.
-
Investigation of Direct Nuclear Reactions in a Storage Ring Using In-Ring Detection
Authors:
J. C. Zamora,
T. Aumann,
S. Bagchi,
S. Bishop,
M. Bo,
S. Bonig,
C. Brandau,
M. Csatlos,
T. Davinson,
I. Dillmann,
C. Dimopoulou,
D. T. Doherty,
P. Egelhof,
V. Eremin,
A. Estrade,
A. Evdokimovc,
J. L. Ferreira,
T. Furuno,
H. Geissel,
R. Gernhauser,
A. Gumberidze,
M. N. Harakeh,
A. -L. Hartig,
M. Heil,
S. Ilieva
, et al. (40 additional authors not shown)
Abstract:
\textbf{Background:} Experiments involving nuclear reactions in a storage ring offer exceptional possibilities for precise measurements in inverse kinematics. These experiments provide excellent angular and energy resolution by particle spectroscopy, in addition to high luminosities. However, the extremely low-pressure environment maintained in the storage rings poses significant difficulties for…
▽ More
\textbf{Background:} Experiments involving nuclear reactions in a storage ring offer exceptional possibilities for precise measurements in inverse kinematics. These experiments provide excellent angular and energy resolution by particle spectroscopy, in addition to high luminosities. However, the extremely low-pressure environment maintained in the storage rings poses significant difficulties for experiments employing detectors or any outgassing material in the ring.
\textbf{Purpose:} To investigate nuclear reactions in inverse kinematics using the storage-ring technique. The reactions were induced by scattering of a ${}^{20}\mathrm{Ne}$ beam off a hydrogen target at an energy of 50~MeV/u.
\textbf{Method:} A beam of fully stripped ${}^{20}$Ne ions was injected into the ESR storage ring at an energy of 50 MeV/u. The beam interacted with an internal hydrogen gas-jet target. An ultra-high vacuum compatible detector setup was installed around the gas jet inside the ring to measure the recoiling particles generated by nuclear reactions.
\textbf{Results:} Multiple reaction channels were observed during the experiment. In particular, we present the results from studies on elastic and inelastic scattering, as well as the neutron transfer reaction ${}^{20}\mathrm{Ne}(p,d){}^{19}\mathrm{Ne}^*$. The experimental data were compared to calculations that took into account the most significant excited states, using a coupled-reaction channel approach. A very good agreement with the experimental data was achieved.
\textbf{Conclusions:} The present results are the first demonstration of the investigation transfer reactions using detectors directly installed in the ring. This provides an important proof-of-principle for prospective studies with far-from-stability radioactive beams in the future.
△ Less
Submitted 6 March, 2025;
originally announced March 2025.
-
Dark Photon Polarimetry
Authors:
Fiesta Ting Yan Leung,
Tao Liu,
Sida Lu,
Jing Ren,
Cheuk Kan Kelvin Yue,
Kaifeng Zheng
Abstract:
We propose detecting dark photon (DP), a major candidate for wave dark matter, through polarimetry. The DP can modify Maxwell's equations, due to its kinetic mixing with regular photon, inducing an oscillating component in the electromagnetic field. This may leave an imprint in polarimetric light signals, characterised by a distinctive wave pattern in spacetime. As a demonstration, we apply this m…
▽ More
We propose detecting dark photon (DP), a major candidate for wave dark matter, through polarimetry. The DP can modify Maxwell's equations, due to its kinetic mixing with regular photon, inducing an oscillating component in the electromagnetic field. This may leave an imprint in polarimetric light signals, characterised by a distinctive wave pattern in spacetime. As a demonstration, we apply this methodology to investigate ultralight DP produced through the superradiance of supermassive black holes. Then using the polarimetric measurements of the radiation from M87$^\ast$ at the Event Horizon Telescope, we show that all Stokes parameters can serve as a probe in conducting this task. Especially, the absence of significant temporal variation in the linear-polarisation position angle of the M87$^\ast$ images allows us to set novel limits on the photon-DP mixing parameter over the rarely-explored DP mass range of $10^{-22}$--$10^{-20}$eV, with the best reach of $\sim 10^{-8}$ achieved at $\sim 10^{-20.2}$eV. Given the universality of its underlying physics, we expect the DP polarimetry to be broadly applied for the DP detection in laboratory experiments and astronomical observations.
△ Less
Submitted 23 January, 2025;
originally announced January 2025.
-
Progress of the anti-obesity of Berberine
Authors:
Kong Yue,
Yang haokun,
Nie Rong,
Zhang Xuxiang,
Zhang Hongtao,
Nian Xin
Abstract:
Obesity is defined as the excessive accumulation or abnormal distribution of body fat. According to data from World Obesity Atlas 2024, the increase in prevalence of obesity has become a major worldwide health problem in adults as well as among children and adolescents. Although an increasing number of drugs have been approved for the treatment of obesity in recent years, many of these drugs have…
▽ More
Obesity is defined as the excessive accumulation or abnormal distribution of body fat. According to data from World Obesity Atlas 2024, the increase in prevalence of obesity has become a major worldwide health problem in adults as well as among children and adolescents. Although an increasing number of drugs have been approved for the treatment of obesity in recent years, many of these drugs have inevitable side effects which have increased the demand for new safe, accessible and effective drugs for obesity and prompt interest in natural products. Berberine (BBR) and its metabolites, known for their multiple pharmacological effects. Recent studies have emphatically highlighted the anti-obesity benefits of BBR and the underlying mechanisms have been gradually elucidated. However, its clinical application is limited by poor oral absorption and low bioavailability. Based on this, this review summarizes current research on the anti-obesity effects of BBR and its metabolites, including advancements in clinical trail results, understanding potential molecular mechanisms and absorption and bioavailability. As a natural compound derived from plants, BBR holds potential as an alternative approach for managing obesity.
△ Less
Submitted 4 January, 2025;
originally announced January 2025.
-
SituFont: A Just-in-Time Adaptive Intervention System for Enhancing Mobile Readability in Situational Visual Impairments
Authors:
Kun Yue,
Mingshan Zhang,
Jingruo Chen,
Chun Yu,
Kexin Nie,
Zhiqi Gao,
Jinghan Yang,
Chen Liang,
Yuanchun Shi
Abstract:
Situational visual impairments (SVIs) significantly impact mobile readability, causing user discomfort and hindering information access. This paper introduces SituFont, a novel just-in-time adaptive intervention (JITAI) system designed to enhance mobile text readability by semi-automatically adjusting font parameters in response to real-time contextual changes. Leveraging smartphone sensors and a…
▽ More
Situational visual impairments (SVIs) significantly impact mobile readability, causing user discomfort and hindering information access. This paper introduces SituFont, a novel just-in-time adaptive intervention (JITAI) system designed to enhance mobile text readability by semi-automatically adjusting font parameters in response to real-time contextual changes. Leveraging smartphone sensors and a human-in-the-loop approach, SituFont personalizes the reading experience by adapting to individual user preferences, including personal factors such as fatigue and distraction level, and environmental factors like lighting, motion, and location. To inform the design of SituFont, we conducted formative interviews (N=15) to identify key SVI factors affecting readability and controlled experiments (N=18) to quantify the relationship between these factors and optimal text parameters. We then evaluated SituFont's effectiveness through a comparative user study under eight simulated SVI scenarios (N=12), demonstrating its ability to overcome SVIs. Our findings highlight the potential of JITAI systems like SituFont to mitigate the impact of SVIs and enhance mobile accessibility.
△ Less
Submitted 12 October, 2024;
originally announced October 2024.
-
Federated Learning Nodes Can Reconstruct Peers' Image Data
Authors:
Ethan Wilson,
Kai Yue,
Chau-Wai Wong,
Huaiyu Dai
Abstract:
Federated learning (FL) is a privacy-preserving machine learning framework that enables multiple nodes to train models on their local data and periodically average weight updates to benefit from other nodes' training. Each node's goal is to collaborate with other nodes to improve the model's performance while keeping its training data private. However, this framework does not guarantee data privac…
▽ More
Federated learning (FL) is a privacy-preserving machine learning framework that enables multiple nodes to train models on their local data and periodically average weight updates to benefit from other nodes' training. Each node's goal is to collaborate with other nodes to improve the model's performance while keeping its training data private. However, this framework does not guarantee data privacy. Prior work has shown that the gradient-sharing steps in FL can be vulnerable to data reconstruction attacks from an honest-but-curious central server. In this work, we show that an honest-but-curious node/client can also launch attacks to reconstruct peers' image data through gradient inversion, presenting a severe privacy risk. We demonstrate that a single client can silently reconstruct other clients' private images using diluted information available within consecutive updates. We leverage state-of-the-art diffusion models to enhance the perceptual quality and recognizability of the reconstructed images, further demonstrating the risk of information leakage at a semantic level. This highlights the need for more robust privacy-preserving mechanisms that protect against silent client-side attacks during federated training.
△ Less
Submitted 12 June, 2025; v1 submitted 6 October, 2024;
originally announced October 2024.
-
NTK-DFL: Enhancing Decentralized Federated Learning in Heterogeneous Settings via Neural Tangent Kernel
Authors:
Gabriel Thompson,
Kai Yue,
Chau-Wai Wong,
Huaiyu Dai
Abstract:
Decentralized federated learning (DFL) is a collaborative machine learning framework for training a model across participants without a central server or raw data exchange. DFL faces challenges due to statistical heterogeneity, as participants often possess data of different distributions reflecting local environments and user behaviors. Recent work has shown that the neural tangent kernel (NTK) a…
▽ More
Decentralized federated learning (DFL) is a collaborative machine learning framework for training a model across participants without a central server or raw data exchange. DFL faces challenges due to statistical heterogeneity, as participants often possess data of different distributions reflecting local environments and user behaviors. Recent work has shown that the neural tangent kernel (NTK) approach, when applied to federated learning in a centralized framework, can lead to improved performance. We propose an approach leveraging the NTK to train client models in the decentralized setting, while introducing a synergy between NTK-based evolution and model averaging. This synergy exploits inter-client model deviation and improves both accuracy and convergence in heterogeneous settings. Empirical results demonstrate that our approach consistently achieves higher accuracy than baselines in highly heterogeneous settings, where other approaches often underperform. Additionally, it reaches target performance in 4.6 times fewer communication rounds. We validate our approach across multiple datasets, network topologies, and heterogeneity settings to ensure robustness and generalization.
△ Less
Submitted 12 June, 2025; v1 submitted 2 October, 2024;
originally announced October 2024.
-
Advancing Hybrid Defense for Byzantine Attacks in Federated Learning
Authors:
Kai Yue,
Richeng Jin,
Chau-Wai Wong,
Huaiyu Dai
Abstract:
Federated learning (FL) enables multiple clients to collaboratively train a global model without sharing their local data. Recent studies have highlighted the vulnerability of FL to Byzantine attacks, where malicious clients send poisoned updates to degrade model performance. In particular, many attacks have been developed targeting specific aggregation rules, whereas various defense mechanisms ha…
▽ More
Federated learning (FL) enables multiple clients to collaboratively train a global model without sharing their local data. Recent studies have highlighted the vulnerability of FL to Byzantine attacks, where malicious clients send poisoned updates to degrade model performance. In particular, many attacks have been developed targeting specific aggregation rules, whereas various defense mechanisms have been designed for dedicated threat models. This paper studies the resilience of attack-agnostic FL scenarios, where the server lacks prior knowledge of both the attackers' strategies and the number of malicious clients involved. We first introduce hybrid defenses against state-of-the-art attacks. Our goal is to identify a general-purpose aggregation rule that performs well on average while also avoiding worst-case vulnerabilities. By adaptively selecting from available defenses, we demonstrate that the server remains robust even when confronted with a substantial proportion of poisoned updates. We also emphasize that existing FL defenses should not automatically be regarded as secure, as demonstrated by the newly proposed Trapsetter attack. The proposed attack outperforms other state-of-the-art attacks by further increasing the impact of the attack by 5-15%. Our findings highlight the ongoing need for the development of Byzantine-resilient aggregation algorithms in FL.
△ Less
Submitted 12 June, 2025; v1 submitted 10 September, 2024;
originally announced September 2024.
-
AngleSizer: Enhancing Spatial Scale Perception for the Visually Impaired with an Interactive Smartphone Assistant
Authors:
Xiaoqing Jing,
Chun Yu,
Kun Yue,
Liangyou Lu,
Nan Gao,
Weinan Shi,
Mingshan Zhang,
Ruolin Wang,
Yuanchun Shi
Abstract:
Spatial perception, particularly at small and medium scales, is an essential human sense but poses a significant challenge for the blind and visually impaired (BVI). Traditional learning methods for BVI individuals are often constrained by the limited availability of suitable learning environments and high associated costs. To tackle these barriers, we conducted comprehensive studies to delve into…
▽ More
Spatial perception, particularly at small and medium scales, is an essential human sense but poses a significant challenge for the blind and visually impaired (BVI). Traditional learning methods for BVI individuals are often constrained by the limited availability of suitable learning environments and high associated costs. To tackle these barriers, we conducted comprehensive studies to delve into the real-world challenges faced by the BVI community. We have identified several key factors hindering their spatial perception, including the high social cost of seeking assistance, inefficient methods of information intake, cognitive and behavioral disconnects, and a lack of opportunities for hands-on exploration. As a result, we developed AngleSizer, an innovative teaching assistant that leverages smartphone technology. AngleSizer is designed to enable BVI individuals to use natural interaction gestures to try, feel, understand, and learn about sizes and angles effectively. This tool incorporates dual vibration-audio feedback, carefully crafted teaching processes, and specialized learning modules to enhance the learning experience. Extensive user experiments validated its efficacy and applicability with diverse abilities and visual conditions. Ultimately, our research not only expands the understanding of BVI behavioral patterns but also greatly improves their spatial perception capabilities, in a way that is both cost-effective and allows for independent learning.
△ Less
Submitted 24 August, 2024;
originally announced August 2024.
-
From Pixels to Prose: A Large Dataset of Dense Image Captions
Authors:
Vasu Singla,
Kaiyu Yue,
Sukriti Paul,
Reza Shirkavand,
Mayuka Jayawardhana,
Alireza Ganjdanesh,
Heng Huang,
Abhinav Bhatele,
Gowthami Somepalli,
Tom Goldstein
Abstract:
Training large vision-language models requires extensive, high-quality image-text pairs. Existing web-scraped datasets, however, are noisy and lack detailed image descriptions. To bridge this gap, we introduce PixelProse, a comprehensive dataset of over 16M (million) synthetically generated captions, leveraging cutting-edge vision-language models for detailed and accurate descriptions. To ensure d…
▽ More
Training large vision-language models requires extensive, high-quality image-text pairs. Existing web-scraped datasets, however, are noisy and lack detailed image descriptions. To bridge this gap, we introduce PixelProse, a comprehensive dataset of over 16M (million) synthetically generated captions, leveraging cutting-edge vision-language models for detailed and accurate descriptions. To ensure data integrity, we rigorously analyze our dataset for problematic content, including child sexual abuse material (CSAM), personally identifiable information (PII), and toxicity. We also provide valuable metadata such as watermark presence and aesthetic scores, aiding in further dataset filtering. We hope PixelProse will be a valuable resource for future vision-language research. PixelProse is available at https://huggingface.co/datasets/tomg-group-umd/pixelprose
△ Less
Submitted 14 June, 2024;
originally announced June 2024.
-
Behavioral analysis in immersive learning environments: A systematic literature review and research agenda
Authors:
Yu Liu,
Kang Yue,
Yue Liu
Abstract:
The rapid growth of immersive technologies in educational areas has increased research interest in analyzing the specific behavioral patterns of learners in immersive learning environments. Considering the fact that research on the technical affordances of immersive technologies and the pedagogical affordances of behavioral analysis remains fragmented, this study first contributes by developing a…
▽ More
The rapid growth of immersive technologies in educational areas has increased research interest in analyzing the specific behavioral patterns of learners in immersive learning environments. Considering the fact that research on the technical affordances of immersive technologies and the pedagogical affordances of behavioral analysis remains fragmented, this study first contributes by developing a conceptual framework that amalgamates learning requirements, specification, evaluation, and iteration into an integrated model to identify learning benefits and potential hurdles of behavioral analysis in immersive learning environments. Then, a systematic review was conducted underpinning the proposed conceptual framework to retrieve valuable empirical evidence from the 40 eligible articles during the last decade. The review findings suggest that (1) there is an essential need to sufficiently prepare the salient pedagogical requirements to define the specific learning stage, envisage intended cognitive objectives, and specify an appropriate set of learning activities, when developing comprehensive plans on behavioral analysis in immersive learning environments. (2) Researchers could customize the unique immersive experimental implementation by considering factors from four dimensions: learner, pedagogy, context, and representation. (3) The behavioral patterns constructed in immersive learning environments vary by considering the influence of behavioral analysis techniques, research themes, and immersive technical features. (4) The use of behavioral analysis in immersive learning environments faces several challenges from technical, implementation, and data processing perspectives. This study also articulates critical research agenda that could drive future investigation on behavioral analysis in immersive learning environments.
△ Less
Submitted 6 May, 2024;
originally announced May 2024.
-
Inference for Heterogeneous Graphical Models using Doubly High-Dimensional Linear-Mixed Models
Authors:
Kun Yue,
Eardi Lila,
Ali Shojaie
Abstract:
Motivated by the problem of inferring the graph structure of functional connectivity networks from multi-level functional magnetic resonance imaging data, we develop a valid inference framework for high-dimensional graphical models that accounts for group-level heterogeneity. We introduce a neighborhood-based method to learn the graph structure and reframe the problem as that of inferring fixed ef…
▽ More
Motivated by the problem of inferring the graph structure of functional connectivity networks from multi-level functional magnetic resonance imaging data, we develop a valid inference framework for high-dimensional graphical models that accounts for group-level heterogeneity. We introduce a neighborhood-based method to learn the graph structure and reframe the problem as that of inferring fixed effect parameters in a doubly high-dimensional linear mixed model. Specifically, we propose a LASSO-based estimator and a de-biased LASSO-based inference framework for the fixed effect parameters in the doubly high-dimensional linear mixed model, leveraging random matrix theory to deal with challenges induced by the identical fixed and random effect design matrices arising in our setting. Moreover, we introduce consistent estimators for the variance components to identify subject-specific edges in the inferred graph. To illustrate the generality of the proposed approach, we also adapt our method to account for serial correlation by learning heterogeneous graphs in the setting of a vector autoregressive model. We demonstrate the performance of the proposed framework using real data and benchmark simulation studies.
△ Less
Submitted 15 March, 2024;
originally announced March 2024.
-
Neutron radius determination of 133Cs and its impact on the interpretation of CEvNS-CsI measurement
Authors:
Y. Huang,
S. Y. Xia,
Y. F. Li,
X. L. Tu,
J. T. Zhang,
C. J. Shao,
K. Yue,
P. Ma,
Y. F. Niu,
Z. P. Li,
Y. Kuang,
X. Q. Liu,
J. F. Han,
P. Egelhof,
Yu. A. Litvinov,
M. Wang,
Y. H. Zhang,
X. H. Zhou,
Z. Y. Sun
Abstract:
Proton-$^{133}$Cs elastic scattering at low momentum transfer is performed using an in-ring reaction technique at the Cooler Storage Ring at the Heavy Ion Research Facility in Lanzhou. Recoil protons from the elastic collisions between the internal H$_2$-gas target and the circulating $^{133}$Cs ions at 199.4 MeV/u are detected by a silicon-strip detector. The matter radius of $^{133}$Cs is deduce…
▽ More
Proton-$^{133}$Cs elastic scattering at low momentum transfer is performed using an in-ring reaction technique at the Cooler Storage Ring at the Heavy Ion Research Facility in Lanzhou. Recoil protons from the elastic collisions between the internal H$_2$-gas target and the circulating $^{133}$Cs ions at 199.4 MeV/u are detected by a silicon-strip detector. The matter radius of $^{133}$Cs is deduced by describing the measured differential cross sections using the Glauber model. Employing the adopted proton distribution radius, a point-neutron radius of 4.86(21) fm for $^{133}$Cs is obtained. With the newly determined neutron radius, the weak mixing angle sin$^2 θ_W$ is independently extracted to be 0.227(28) by fitting the coherent elastic neutrino-nucleus scattering data. Our work limits the sin$^2 θ_W$ value in a range smaller than the ones proposed by the previous independent approaches, and would play an important role in searching new physics via the high precision CE$ν$NS-CsI cross section data in the near future.
△ Less
Submitted 8 April, 2024; v1 submitted 6 March, 2024;
originally announced March 2024.
-
TernaryVote: Differentially Private, Communication Efficient, and Byzantine Resilient Distributed Optimization on Heterogeneous Data
Authors:
Richeng Jin,
Yujie Gu,
Kai Yue,
Xiaofan He,
Zhaoyang Zhang,
Huaiyu Dai
Abstract:
Distributed training of deep neural networks faces three critical challenges: privacy preservation, communication efficiency, and robustness to fault and adversarial behaviors. Although significant research efforts have been devoted to addressing these challenges independently, their synthesis remains less explored. In this paper, we propose TernaryVote, which combines a ternary compressor and the…
▽ More
Distributed training of deep neural networks faces three critical challenges: privacy preservation, communication efficiency, and robustness to fault and adversarial behaviors. Although significant research efforts have been devoted to addressing these challenges independently, their synthesis remains less explored. In this paper, we propose TernaryVote, which combines a ternary compressor and the majority vote mechanism to realize differential privacy, gradient compression, and Byzantine resilience simultaneously. We theoretically quantify the privacy guarantee through the lens of the emerging f-differential privacy (DP) and the Byzantine resilience of the proposed algorithm. Particularly, in terms of privacy guarantees, compared to the existing sign-based approach StoSign, the proposed method improves the dimension dependence on the gradient size and enjoys privacy amplification by mini-batch sampling while ensuring a comparable convergence rate. We also prove that TernaryVote is robust when less than 50% of workers are blind attackers, which matches that of SIGNSGD with majority vote. Extensive experimental results validate the effectiveness of the proposed algorithm.
△ Less
Submitted 16 February, 2024;
originally announced February 2024.
-
Two Simple Principles for Diffusion-Based Test-Time Adaptation
Authors:
Kaiyu Song,
Hanjiang Lai,
Yan Pan,
Kun Yue,
Jian Yin
Abstract:
Recently, diffusion-based test-time adaptations (TTA) have shown great advances, which leverage a diffusion model to map the images in the unknown test domain to the training domain. The unseen and diverse test domains make diffusion-based TTA an ill-posed problem. In this paper, we unravel two simple principles of the design tricks for diffusion-based methods. Intuitively, \textit{Principle 1} sa…
▽ More
Recently, diffusion-based test-time adaptations (TTA) have shown great advances, which leverage a diffusion model to map the images in the unknown test domain to the training domain. The unseen and diverse test domains make diffusion-based TTA an ill-posed problem. In this paper, we unravel two simple principles of the design tricks for diffusion-based methods. Intuitively, \textit{Principle 1} says semantic similarity preserving. We should preserve the semantic similarity between the original and generated test images. \textit{Principle 2} suggests minimal modifications. This principle enables the diffusion to map the test images to the training domain with minimal modifications of the test images. In particular, following the two principles, we propose our simple yet effective principle-guided diffusion-based test-time adaptation method (PDDA). Concretely, following Principle 1, we propose a semantic keeper, the method to preserve feature similarity, where the semantic keeper could filter the corruption introduced from the test domain, thus better preserving the semantics. Following Principle 2, we propose a modification keeper, where we introduce a regularization constraint into the generative process to minimize modifications to the test image. Meanwhile, there is a hidden conflict between the two principles. We further introduce the gradient-based view to unify the direction generated from two principles. Extensive experiments on CIFAR-10C, CIFAR-100C, ImageNet-W, and ImageNet-C with WideResNet-28-10, ResNet-50, Swin-T, and ConvNext-T demonstrate that PDDA significantly performs better than the complex state-of-the-art baselines. Specifically, PDDA achieves 2.4\% average accuracy improvements in ImageNet-C without any training process.
△ Less
Submitted 11 March, 2025; v1 submitted 7 December, 2023;
originally announced December 2023.
-
Object Recognition as Next Token Prediction
Authors:
Kaiyu Yue,
Bor-Chun Chen,
Jonas Geiping,
Hengduo Li,
Tom Goldstein,
Ser-Nam Lim
Abstract:
We present an approach to pose object recognition as next token prediction. The idea is to apply a language decoder that auto-regressively predicts the text tokens from image embeddings to form labels. To ground this prediction process in auto-regression, we customize a non-causal attention mask for the decoder, incorporating two key features: modeling tokens from different labels to be independen…
▽ More
We present an approach to pose object recognition as next token prediction. The idea is to apply a language decoder that auto-regressively predicts the text tokens from image embeddings to form labels. To ground this prediction process in auto-regression, we customize a non-causal attention mask for the decoder, incorporating two key features: modeling tokens from different labels to be independent, and treating image tokens as a prefix. This masking mechanism inspires an efficient method - one-shot sampling - to simultaneously sample tokens of multiple labels in parallel and rank generated labels by their probabilities during inference. To further enhance the efficiency, we propose a simple strategy to construct a compact decoder by simply discarding the intermediate blocks of a pretrained language model. This approach yields a decoder that matches the full model's performance while being notably more efficient. The code is available at https://github.com/kaiyuyue/nxtp
△ Less
Submitted 31 March, 2024; v1 submitted 4 December, 2023;
originally announced December 2023.
-
FetusMapV2: Enhanced Fetal Pose Estimation in 3D Ultrasound
Authors:
Chaoyu Chen,
Xin Yang,
Yuhao Huang,
Wenlong Shi,
Yan Cao,
Mingyuan Luo,
Xindi Hu,
Lei Zhue,
Lequan Yu,
Kejuan Yue,
Yuanji Zhang,
Yi Xiong,
Dong Ni,
Weijun Huang
Abstract:
Fetal pose estimation in 3D ultrasound (US) involves identifying a set of associated fetal anatomical landmarks. Its primary objective is to provide comprehensive information about the fetus through landmark connections, thus benefiting various critical applications, such as biometric measurements, plane localization, and fetal movement monitoring. However, accurately estimating the 3D fetal pose…
▽ More
Fetal pose estimation in 3D ultrasound (US) involves identifying a set of associated fetal anatomical landmarks. Its primary objective is to provide comprehensive information about the fetus through landmark connections, thus benefiting various critical applications, such as biometric measurements, plane localization, and fetal movement monitoring. However, accurately estimating the 3D fetal pose in US volume has several challenges, including poor image quality, limited GPU memory for tackling high dimensional data, symmetrical or ambiguous anatomical structures, and considerable variations in fetal poses. In this study, we propose a novel 3D fetal pose estimation framework (called FetusMapV2) to overcome the above challenges. Our contribution is three-fold. First, we propose a heuristic scheme that explores the complementary network structure-unconstrained and activation-unreserved GPU memory management approaches, which can enlarge the input image resolution for better results under limited GPU memory. Second, we design a novel Pair Loss to mitigate confusion caused by symmetrical and similar anatomical structures. It separates the hidden classification task from the landmark localization task and thus progressively eases model learning. Last, we propose a shape priors-based self-supervised learning by selecting the relatively stable landmarks to refine the pose online. Extensive experiments and diverse applications on a large-scale fetal US dataset including 1000 volumes with 22 landmarks per volume demonstrate that our method outperforms other strong competitors.
△ Less
Submitted 30 October, 2023;
originally announced October 2023.
-
AIR: Threats of Adversarial Attacks on Deep Learning-Based Information Recovery
Authors:
Jinyin Chen,
Jie Ge,
Shilian Zheng,
Linhui Ye,
Haibin Zheng,
Weiguo Shen,
Keqiang Yue,
Xiaoniu Yang
Abstract:
A wireless communications system usually consists of a transmitter which transmits the information and a receiver which recovers the original information from the received distorted signal. Deep learning (DL) has been used to improve the performance of the receiver in complicated channel environments and state-of-the-art (SOTA) performance has been achieved. However, its robustness has not been in…
▽ More
A wireless communications system usually consists of a transmitter which transmits the information and a receiver which recovers the original information from the received distorted signal. Deep learning (DL) has been used to improve the performance of the receiver in complicated channel environments and state-of-the-art (SOTA) performance has been achieved. However, its robustness has not been investigated. In order to evaluate the robustness of DL-based information recovery models under adversarial circumstances, we investigate adversarial attacks on the SOTA DL-based information recovery model, i.e., DeepReceiver. We formulate the problem as an optimization problem with power and peak-to-average power ratio (PAPR) constraints. We design different adversarial attack methods according to the adversary's knowledge of DeepReceiver's model and/or testing samples. Extensive experiments show that the DeepReceiver is vulnerable to the designed attack methods in all of the considered scenarios. Even in the scenario of both model and test sample restricted, the adversary can attack the DeepReceiver and increase its bit error rate (BER) above 10%. It can also be found that the DeepReceiver is vulnerable to adversarial perturbations even with very low power and limited PAPR. These results suggest that defense measures should be taken to enhance the robustness of DeepReceiver.
△ Less
Submitted 17 August, 2023;
originally announced September 2023.
-
A Game-Theoretic Framework for AI Governance
Authors:
Na Zhang,
Kun Yue,
Chao Fang
Abstract:
As a transformative general-purpose technology, AI has empowered various industries and will continue to shape our lives through ubiquitous applications. Despite the enormous benefits from wide-spread AI deployment, it is crucial to address associated downside risks and therefore ensure AI advances are safe, fair, responsible, and aligned with human values. To do so, we need to establish effective…
▽ More
As a transformative general-purpose technology, AI has empowered various industries and will continue to shape our lives through ubiquitous applications. Despite the enormous benefits from wide-spread AI deployment, it is crucial to address associated downside risks and therefore ensure AI advances are safe, fair, responsible, and aligned with human values. To do so, we need to establish effective AI governance. In this work, we show that the strategic interaction between the regulatory agencies and AI firms has an intrinsic structure reminiscent of a Stackelberg game, which motivates us to propose a game-theoretic modeling framework for AI governance. In particular, we formulate such interaction as a Stackelberg game composed of a leader and a follower, which captures the underlying game structure compared to its simultaneous play counterparts. Furthermore, the choice of the leader naturally gives rise to two settings. And we demonstrate that our proposed model can serves as a unified AI governance framework from two aspects: firstly we can map one setting to the AI governance of civil domains and the other to the safety-critical and military domains, secondly, the two settings of governance could be chosen contingent on the capability of the intelligent systems. To the best of our knowledge, this work is the first to use game theory for analyzing and structuring AI governance. We also discuss promising directions and hope this can help stimulate research interest in this interdisciplinary area. On a high, we hope this work would contribute to develop a new paradigm for technology policy: the quantitative and AI-driven methods for the technology policy field, which holds significant promise for overcoming many shortcomings of existing qualitative approaches.
△ Less
Submitted 24 May, 2023;
originally announced May 2023.
-
Segment Anything Model for Medical Images?
Authors:
Yuhao Huang,
Xin Yang,
Lian Liu,
Han Zhou,
Ao Chang,
Xinrui Zhou,
Rusi Chen,
Junxuan Yu,
Jiongquan Chen,
Chaoyu Chen,
Sijing Liu,
Haozhe Chi,
Xindi Hu,
Kejuan Yue,
Lei Li,
Vicente Grau,
Deng-Ping Fan,
Fajin Dong,
Dong Ni
Abstract:
The Segment Anything Model (SAM) is the first foundation model for general image segmentation. It has achieved impressive results on various natural image segmentation tasks. However, medical image segmentation (MIS) is more challenging because of the complex modalities, fine anatomical structures, uncertain and complex object boundaries, and wide-range object scales. To fully validate SAM's perfo…
▽ More
The Segment Anything Model (SAM) is the first foundation model for general image segmentation. It has achieved impressive results on various natural image segmentation tasks. However, medical image segmentation (MIS) is more challenging because of the complex modalities, fine anatomical structures, uncertain and complex object boundaries, and wide-range object scales. To fully validate SAM's performance on medical data, we collected and sorted 53 open-source datasets and built a large medical segmentation dataset with 18 modalities, 84 objects, 125 object-modality paired targets, 1050K 2D images, and 6033K masks. We comprehensively analyzed different models and strategies on the so-called COSMOS 1050K dataset. Our findings mainly include the following: 1) SAM showed remarkable performance in some specific objects but was unstable, imperfect, or even totally failed in other situations. 2) SAM with the large ViT-H showed better overall performance than that with the small ViT-B. 3) SAM performed better with manual hints, especially box, than the Everything mode. 4) SAM could help human annotation with high labeling quality and less time. 5) SAM was sensitive to the randomness in the center point and tight box prompts, and may suffer from a serious performance drop. 6) SAM performed better than interactive methods with one or a few points, but will be outpaced as the number of points increases. 7) SAM's performance correlated to different factors, including boundary complexity, intensity differences, etc. 8) Finetuning the SAM on specific medical tasks could improve its average DICE performance by 4.39% and 6.68% for ViT-B and ViT-H, respectively. We hope that this comprehensive report can help researchers explore the potential of SAM applications in MIS, and guide how to appropriately use and develop SAM.
△ Less
Submitted 17 January, 2024; v1 submitted 28 April, 2023;
originally announced April 2023.
-
Machine-Learning Recognition of Dzyaloshinskii-Moriya Interaction from Magnetometry
Authors:
Bradley J. Fugetta,
Zhijie Chen,
Dhritiman Bhattacharya,
Kun Yue,
Kai Liu,
Amy Y. Liu,
Gen Yin
Abstract:
The Dzyaloshinskii-Moriya interaction (DMI), which is the antisymmetric part of the exchange interaction between neighboring local spins, winds the spin manifold and can stabilize non-trivial topological spin textures. Since topology is a robust information carrier, characterization techniques that can extract the DMI magnitude are important for the discovery and optimization of spintronic materia…
▽ More
The Dzyaloshinskii-Moriya interaction (DMI), which is the antisymmetric part of the exchange interaction between neighboring local spins, winds the spin manifold and can stabilize non-trivial topological spin textures. Since topology is a robust information carrier, characterization techniques that can extract the DMI magnitude are important for the discovery and optimization of spintronic materials. Existing experimental techniques for quantitative determination of DMI, such as high-resolution magnetic imaging of spin textures and measurement of magnon or transport properties, are time consuming and require specialized instrumentation. Here we show that a convolutional neural network can extract the DMI magnitude from minor hysteresis loops, or magnetic "fingerprints" of a material. These hysteresis loops are readily available by conventional magnetometry measurements. This provides a convenient tool to investigate topological spin textures for next-generation information processing.
△ Less
Submitted 31 August, 2023; v1 submitted 12 April, 2023;
originally announced April 2023.
-
Cancer Subtyping by Improved Transcriptomic Features Using Vector Quantized Variational Autoencoder
Authors:
Zheng Chen,
Ziwei Yang,
Lingwei Zhu,
Guang Shi,
Kun Yue,
Takashi Matsubara,
Shigehiko Kanaya,
MD Altaf-Ul-Amin
Abstract:
Defining and separating cancer subtypes is essential for facilitating personalized therapy modality and prognosis of patients. The definition of subtypes has been constantly recalibrated as a result of our deepened understanding. During this recalibration, researchers often rely on clustering of cancer data to provide an intuitive visual reference that could reveal the intrinsic characteristics of…
▽ More
Defining and separating cancer subtypes is essential for facilitating personalized therapy modality and prognosis of patients. The definition of subtypes has been constantly recalibrated as a result of our deepened understanding. During this recalibration, researchers often rely on clustering of cancer data to provide an intuitive visual reference that could reveal the intrinsic characteristics of subtypes. The data being clustered are often omics data such as transcriptomics that have strong correlations to the underlying biological mechanism. However, while existing studies have shown promising results, they suffer from issues associated with omics data: sample scarcity and high dimensionality. As such, existing methods often impose unrealistic assumptions to extract useful features from the data while avoiding overfitting to spurious correlations. In this paper, we propose to leverage a recent strong generative model, Vector Quantized Variational AutoEncoder (VQ-VAE), to tackle the data issues and extract informative latent features that are crucial to the quality of subsequent clustering by retaining only information relevant to reconstructing the input. VQ-VAE does not impose strict assumptions and hence its latent features are better representations of the input, capable of yielding superior clustering performance with any mainstream clustering method. Extensive experiments and medical analysis on multiple datasets comprising 10 distinct cancers demonstrate the VQ-VAE clustering results can significantly and robustly improve prognosis over prevalent subtyping systems.
△ Less
Submitted 20 July, 2022;
originally announced July 2022.
-
Gradient Obfuscation Gives a False Sense of Security in Federated Learning
Authors:
Kai Yue,
Richeng Jin,
Chau-Wai Wong,
Dror Baron,
Huaiyu Dai
Abstract:
Federated learning has been proposed as a privacy-preserving machine learning framework that enables multiple clients to collaborate without sharing raw data. However, client privacy protection is not guaranteed by design in this framework. Prior work has shown that the gradient sharing strategies in federated learning can be vulnerable to data reconstruction attacks. In practice, though, clients…
▽ More
Federated learning has been proposed as a privacy-preserving machine learning framework that enables multiple clients to collaborate without sharing raw data. However, client privacy protection is not guaranteed by design in this framework. Prior work has shown that the gradient sharing strategies in federated learning can be vulnerable to data reconstruction attacks. In practice, though, clients may not transmit raw gradients considering the high communication cost or due to privacy enhancement requirements. Empirical studies have demonstrated that gradient obfuscation, including intentional obfuscation via gradient noise injection and unintentional obfuscation via gradient compression, can provide more privacy protection against reconstruction attacks. In this work, we present a new data reconstruction attack framework targeting the image classification task in federated learning. We show that commonly adopted gradient postprocessing procedures, such as gradient quantization, gradient sparsification, and gradient perturbation, may give a false sense of security in federated learning. Contrary to prior studies, we argue that privacy enhancement should not be treated as a byproduct of gradient compression. Additionally, we design a new method under the proposed framework to reconstruct the image at the semantic level. We quantify the semantic privacy leakage and compare with conventional based on image similarity scores. Our comparisons challenge the image data leakage evaluation schemes in the literature. The results emphasize the importance of revisiting and redesigning the privacy protection mechanisms for client data in existing federated learning algorithms.
△ Less
Submitted 13 October, 2022; v1 submitted 8 June, 2022;
originally announced June 2022.
-
Ptychopy: GPU framework for ptychographic data analysis
Authors:
Ke Yue,
Junjing Deng,
Yi Jiang,
Youssef Nashed,
David Vine,
Stefan Vogt
Abstract:
X-ray ptychography imaging at synchrotron facilities like the Advanced Photon Source (APS) involves controlling instrument hardwares to collect a set of diffraction patterns from overlapping coherent illumination spots on extended samples, managing data storage, reconstructing ptychographic images from acquired diffraction patterns, and providing the visualization of results and feedback. In addit…
▽ More
X-ray ptychography imaging at synchrotron facilities like the Advanced Photon Source (APS) involves controlling instrument hardwares to collect a set of diffraction patterns from overlapping coherent illumination spots on extended samples, managing data storage, reconstructing ptychographic images from acquired diffraction patterns, and providing the visualization of results and feedback. In addition to the complicated workflow, ptychography instrument could produce up to several TB's of data per second that is needed to be processed in real time. This brings up the need to develop a high performance, robust and user friendly processing software package for ptychographic data analysis. In this paper we present a software framework which provides functionality of visualization, work flow control, and data reconstruction. To accelerate the computation and large datasets process, the data reconstruction part is implemented with three algorithms, ePIE, DM and LSQML using CUDA-C on GPU.
△ Less
Submitted 24 January, 2022;
originally announced February 2022.
-
Accelerating Laue Depth Reconstruction Algorithm with CUDA
Authors:
Ke Yue,
Schwarz Nicholas,
Tischler Jonathan Z
Abstract:
The Laue diffraction microscopy experiment uses the polychromatic Laue micro-diffraction technique to examine the structure of materials with sub-micron spatial resolution in all three dimensions. During this experiment, local crystallographic orientations, orientation gradients and strains are measured as properties which will be recorded in HDF5 image format. The recorded images will be processe…
▽ More
The Laue diffraction microscopy experiment uses the polychromatic Laue micro-diffraction technique to examine the structure of materials with sub-micron spatial resolution in all three dimensions. During this experiment, local crystallographic orientations, orientation gradients and strains are measured as properties which will be recorded in HDF5 image format. The recorded images will be processed with a depth reconstruction algorithm for future data analysis. But the current depth reconstruction algorithm consumes considerable processing time and might take up to 2 weeks for reconstructing data collected from one single experiment. To improve the depth reconstruction computation speed, we propose a scalable GPU program solution on the depth reconstruction problem in this paper. The test result shows that the running time would be 10 to 20 times faster than the prior CPU design for various size of input data.
△ Less
Submitted 20 January, 2022;
originally announced January 2022.
-
Neural Tangent Kernel Empowered Federated Learning
Authors:
Kai Yue,
Richeng Jin,
Ryan Pilgrim,
Chau-Wai Wong,
Dror Baron,
Huaiyu Dai
Abstract:
Federated learning (FL) is a privacy-preserving paradigm where multiple participants jointly solve a machine learning problem without sharing raw data. Unlike traditional distributed learning, a unique characteristic of FL is statistical heterogeneity, namely, data distributions across participants are different from each other. Meanwhile, recent advances in the interpretation of neural networks h…
▽ More
Federated learning (FL) is a privacy-preserving paradigm where multiple participants jointly solve a machine learning problem without sharing raw data. Unlike traditional distributed learning, a unique characteristic of FL is statistical heterogeneity, namely, data distributions across participants are different from each other. Meanwhile, recent advances in the interpretation of neural networks have seen a wide use of neural tangent kernels (NTKs) for convergence analyses. In this paper, we propose a novel FL paradigm empowered by the NTK framework. The paradigm addresses the challenge of statistical heterogeneity by transmitting update data that are more expressive than those of the conventional FL paradigms. Specifically, sample-wise Jacobian matrices, rather than model weights/gradients, are uploaded by participants. The server then constructs an empirical kernel matrix to update a global model without explicitly performing gradient descent. We further develop a variant with improved communication efficiency and enhanced privacy. Numerical results show that the proposed paradigm can achieve the same accuracy while reducing the number of communication rounds by an order of magnitude compared to federated averaging.
△ Less
Submitted 13 June, 2022; v1 submitted 7 October, 2021;
originally announced October 2021.
-
Federated Learning via Plurality Vote
Authors:
Kai Yue,
Richeng Jin,
Chau-Wai Wong,
Huaiyu Dai
Abstract:
Federated learning allows collaborative workers to solve a machine learning problem while preserving data privacy. Recent studies have tackled various challenges in federated learning, but the joint optimization of communication overhead, learning reliability, and deployment efficiency is still an open problem. To this end, we propose a new scheme named federated learning via plurality vote (FedVo…
▽ More
Federated learning allows collaborative workers to solve a machine learning problem while preserving data privacy. Recent studies have tackled various challenges in federated learning, but the joint optimization of communication overhead, learning reliability, and deployment efficiency is still an open problem. To this end, we propose a new scheme named federated learning via plurality vote (FedVote). In each communication round of FedVote, workers transmit binary or ternary weights to the server with low communication overhead. The model parameters are aggregated via weighted voting to enhance the resilience against Byzantine attacks. When deployed for inference, the model with binary or ternary weights is resource-friendly to edge devices. We show that our proposed method can reduce quantization error and converges faster compared with the methods directly quantizing the model updates.
△ Less
Submitted 9 December, 2022; v1 submitted 6 October, 2021;
originally announced October 2021.
-
Systematic trends of neutron skin thickness versus relative neutron excess
Authors:
J. T. Zhang,
X. L. Tu,
P. Sarriguren,
K. Yue,
Q. Zeng,
Z. Y. Sun,
M. Wang,
Y. H. Zhang,
X. H. Zhou,
Yu. A. Litvinov
Abstract:
Available experimental neutron skin thicknesses of even-even stable Ca, Ni, Sn, Pb, and Cd isotopes are evaluated, and separate trends of neutron skin thickness versus relative neutron excess $δ=(N-Z)/A$ are firstly observed for different isotopic chains. This phenomenon is quantitatively reproduced by the deformed Skyrme Hartree-Fock $+$ BCS model with SLy4 force.
Available experimental neutron skin thicknesses of even-even stable Ca, Ni, Sn, Pb, and Cd isotopes are evaluated, and separate trends of neutron skin thickness versus relative neutron excess $δ=(N-Z)/A$ are firstly observed for different isotopic chains. This phenomenon is quantitatively reproduced by the deformed Skyrme Hartree-Fock $+$ BCS model with SLy4 force.
△ Less
Submitted 7 September, 2021;
originally announced September 2021.
-
Communication-Efficient Federated Learning via Predictive Coding
Authors:
Kai Yue,
Richeng Jin,
Chau-Wai Wong,
Huaiyu Dai
Abstract:
Federated learning can enable remote workers to collaboratively train a shared machine learning model while allowing training data to be kept locally. In the use case of wireless mobile devices, the communication overhead is a critical bottleneck due to limited power and bandwidth. Prior work has utilized various data compression tools such as quantization and sparsification to reduce the overhead…
▽ More
Federated learning can enable remote workers to collaboratively train a shared machine learning model while allowing training data to be kept locally. In the use case of wireless mobile devices, the communication overhead is a critical bottleneck due to limited power and bandwidth. Prior work has utilized various data compression tools such as quantization and sparsification to reduce the overhead. In this paper, we propose a predictive coding based compression scheme for federated learning. The scheme has shared prediction functions among all devices and allows each worker to transmit a compressed residual vector derived from the reference. In each communication round, we select the predictor and quantizer based on the rate-distortion cost, and further reduce the redundancy with entropy coding. Extensive simulations reveal that the communication cost can be reduced up to 99% with even better learning performance when compared with other baseline methods.
△ Less
Submitted 8 January, 2022; v1 submitted 2 August, 2021;
originally announced August 2021.
-
Tree species effects on topsoil carbon stock and concentration are mediated by tree species type, mycorrhizal association, and N-fixing ability at the global scale
Authors:
Yan Peng,
Inger Kappel Schmidt,
Haifeng Zheng,
Petr Heděnec,
Luciana Ruggiero Bachega,
Kai Yue,
Fuzhong Wu,
Lars Vesterdal
Abstract:
Selection of appropriate tree species is an important forest management decision that may affect sequestration of carbon (C) in soil. However, information about tree species effects on soil C stocks at the global scale remains unclear. Here, we quantitatively synthesized 850 observations from field studies that were conducted in a common garden or monoculture plantations to assess how tree species…
▽ More
Selection of appropriate tree species is an important forest management decision that may affect sequestration of carbon (C) in soil. However, information about tree species effects on soil C stocks at the global scale remains unclear. Here, we quantitatively synthesized 850 observations from field studies that were conducted in a common garden or monoculture plantations to assess how tree species type (broadleaf vs. conifer), mycorrhizal association (arbuscular mycorrhizal (AM) vs. ectomycorrhizal (ECM)), and N-fixing ability (N-fixing vs. non-N-fixing), directly and indirectly, affect topsoil (with a median depth of 10 cm) C concentration and stock, and how such effects were influenced by environmental factors such as geographical location and climate. We found that (1) tree species type, mycorrhizal association, and N-fixing ability were all important factors affecting soil C, with lower forest floor C stocks under broadleaved (44%), AM (39%), or N-fixing (28%) trees respectively, but higher mineral soil C concentration (11%, 22%, and 156%) and stock (9%, 10%, and 6%) under broadleaved, AM, and N-fixing trees respectively; (2) tree species type, mycorrhizal association, and N-fixing ability affected forest floor C stock and mineral soil C concentration and stock directly or indirectly through impacting soil properties such as microbial biomass C and nitrogen; (3) tree species effects on mineral soil C concentration and stock were mediated by latitude, MAT, MAP, and forest stand age. These results reveal how tree species and their specific traits influence forest floor C stock and mineral soil C concentration and stock at a global scale. Insights into the underlying mechanisms of tree species effects found in our study would be useful to inform tree species selection in forest management or afforestation aiming to sequester more atmospheric C in soil for mitigation of climate change.
△ Less
Submitted 25 November, 2020; v1 submitted 7 November, 2020;
originally announced November 2020.
-
Measurement of $^{58}$Ni($p$, $p$)$^{58}$Ni elastic scattering at low momentum transfer by using the HIRFL-CSR heavy-ion storage ring
Authors:
K. Yue,
J. T. Zhang,
X. L. Tu,
C. J. Shao,
H. X. Li,
P. Ma,
B. Mei,
X. C. Chen,
Y. Y. Yang,
X. Q. Liu,
Y. M. Xing,
K. H. Fang,
X. H. Li,
Z. Y. Sun,
M. Wang,
P. Egelhof,
Yu. A. Litvinov,
K. Blaum,
Y. H. Zhang,
X. H. Zhou
Abstract:
The very first in-ring reaction experiment at the HIRFL-CSR heavy-ion storage ring, namely proton elastic scattering on stable $^{58}$Ni nuclei, is presented. The circulating $^{58}$Ni$^{19+}$ ions with an energy of 95 MeV/u were interacting repeatedly with an internal hydrogen gas target in the CSRe experimental ring. Low energy proton recoils from the elastic collisions were measured with an ult…
▽ More
The very first in-ring reaction experiment at the HIRFL-CSR heavy-ion storage ring, namely proton elastic scattering on stable $^{58}$Ni nuclei, is presented. The circulating $^{58}$Ni$^{19+}$ ions with an energy of 95 MeV/u were interacting repeatedly with an internal hydrogen gas target in the CSRe experimental ring. Low energy proton recoils from the elastic collisions were measured with an ultra-high vacuum compatible silicon-strip detector. Deduced differential cross sections were normalized by measuring K-shell X-rays from $^{58}$Ni$^{19+}$ projectiles due to the $^{58}$Ni$^{19+}$-H$_2$ ionization collisions. Compared to the experimental cross sections, a good agreement has been achieved with the theoretical predictions in the measured region, which were obtained by using the global phenomenological optical model potentials. Our results enable new research opportunities for optical model potential studies on exotic nuclides by using the in-ring reaction setup at the HIRFL-CSR facility.
△ Less
Submitted 26 October, 2020;
originally announced October 2020.
-
Visible Feature Guidance for Crowd Pedestrian Detection
Authors:
Zhida Huang,
Kaiyu Yue,
Jiangfan Deng,
Feng Zhou
Abstract:
Heavy occlusion and dense gathering in crowd scene make pedestrian detection become a challenging problem, because it's difficult to guess a precise full bounding box according to the invisible human part. To crack this nut, we propose a mechanism called Visible Feature Guidance (VFG) for both training and inference. During training, we adopt visible feature to regress the simultaneous outputs of…
▽ More
Heavy occlusion and dense gathering in crowd scene make pedestrian detection become a challenging problem, because it's difficult to guess a precise full bounding box according to the invisible human part. To crack this nut, we propose a mechanism called Visible Feature Guidance (VFG) for both training and inference. During training, we adopt visible feature to regress the simultaneous outputs of visible bounding box and full bounding box. Then we perform NMS only on visible bounding boxes to achieve the best fitting full box in inference. This manner can alleviate the incapable influence brought by NMS in crowd scene and make full bounding box more precisely. Furthermore, in order to ease feature association in the post application process, such as pedestrian tracking, we apply Hungarian algorithm to associate parts for a human instance. Our proposed method can stably bring about 2~3% improvements in mAP and AP50 for both two-stage and one-stage detector. It's also more effective for MR-2 especially with the stricter IoU. Experiments on Crowdhuman, Cityperson, Caltech and KITTI datasets show that visible feature guidance can help detector achieve promisingly better performances. Moreover, parts association produces a strong benchmark on Crowdhuman for the vision community.
△ Less
Submitted 16 September, 2020; v1 submitted 23 August, 2020;
originally announced August 2020.
-
Matching Guided Distillation
Authors:
Kaiyu Yue,
Jiangfan Deng,
Feng Zhou
Abstract:
Feature distillation is an effective way to improve the performance for a smaller student model, which has fewer parameters and lower computation cost compared to the larger teacher model. Unfortunately, there is a common obstacle - the gap in semantic feature structure between the intermediate features of teacher and student. The classic scheme prefers to transform intermediate features by adding…
▽ More
Feature distillation is an effective way to improve the performance for a smaller student model, which has fewer parameters and lower computation cost compared to the larger teacher model. Unfortunately, there is a common obstacle - the gap in semantic feature structure between the intermediate features of teacher and student. The classic scheme prefers to transform intermediate features by adding the adaptation module, such as naive convolutional, attention-based or more complicated one. However, this introduces two problems: a) The adaptation module brings more parameters into training. b) The adaptation module with random initialization or special transformation isn't friendly for distilling a pre-trained student. In this paper, we present Matching Guided Distillation (MGD) as an efficient and parameter-free manner to solve these problems. The key idea of MGD is to pose matching the teacher channels with students' as an assignment problem. We compare three solutions of the assignment problem to reduce channels from teacher features with partial distillation loss. The overall training takes a coordinate-descent approach between two optimization objects - assignments update and parameters update. Since MGD only contains normalization or pooling operations with negligible computation cost, it is flexible to plug into network with other distillation methods.
△ Less
Submitted 12 October, 2020; v1 submitted 23 August, 2020;
originally announced August 2020.
-
Heterogeneous Swarms for Maritime Dynamic Target Search and Tracking
Authors:
Hian Lee Kwa,
Grgur Tokić,
Roland Bouffanais,
Dick K. P. Yue
Abstract:
Current strategies employed for maritime target search and tracking are primarily based on the use of agents following a predetermined path to perform a systematic sweep of a search area. Recently, dynamic Particle Swarm Optimization (PSO) algorithms have been used together with swarming multi-robot systems (MRS), giving search and tracking solutions the added properties of robustness, scalability…
▽ More
Current strategies employed for maritime target search and tracking are primarily based on the use of agents following a predetermined path to perform a systematic sweep of a search area. Recently, dynamic Particle Swarm Optimization (PSO) algorithms have been used together with swarming multi-robot systems (MRS), giving search and tracking solutions the added properties of robustness, scalability, and flexibility. Swarming MRS also give the end-user the opportunity to incrementally upgrade the robotic system, inevitably leading to the use of heterogeneous swarming MRS. However, such systems have not been well studied and incorporating upgraded agents into a swarm may result in degraded mission performances. In this paper, we propose a PSO-based strategy using a topological k-nearest neighbor graph with tunable exploration and exploitation dynamics with an adaptive repulsion parameter. This strategy is implemented within a simulated swarm of 50 agents with varying proportions of fast agents tracking a target represented by a fictitious binary function. Through these simulations, we are able to demonstrate an increase in the swarm's collective response level and target tracking performance by substituting in a proportion of fast buoys.
△ Less
Submitted 3 August, 2020;
originally announced August 2020.
-
Employing $p+^{58}$Ni elastic scattering for determination of $K$-shell ionization cross section of $^{58}$Ni$^{19+}$ in collisions with hydrogen gas target at 95 MeV/u
Authors:
J. T. Zhang,
K. Yue,
C. J. Shao,
X. L. Tu,
Y. Y. Wang,
P. Ma,
B. Mei,
X. C. Chen,
Y. Y. Yang,
Z. Y. Sun,
M. Wang,
V. P. Shevelko,
I. Yu. Tolstikhina,
Yu. A. Litvinov,
Y. H. Zhang,
X. H. Zhou
Abstract:
We present a new experimental method for measuring inner-shell ionization cross sections of low-charged ions colliding with hydrogen gas target in a storage ring. The method is based on a calibration by the well-known differential cross sections of proton elastic scattering on nuclei. $K$-shell ionization cross section of 1047(100) barn for the 95 MeV/u $^{58}$Ni$^{19+}$ ions colliding with hydrog…
▽ More
We present a new experimental method for measuring inner-shell ionization cross sections of low-charged ions colliding with hydrogen gas target in a storage ring. The method is based on a calibration by the well-known differential cross sections of proton elastic scattering on nuclei. $K$-shell ionization cross section of 1047(100) barn for the 95 MeV/u $^{58}$Ni$^{19+}$ ions colliding with hydrogen atoms was obtained in this work. Compared to the measured ionization cross section, a good agreement is achieved with the prediction by the Relativistic Ionization CODE Modified program (RICODE-M).
△ Less
Submitted 31 May, 2020;
originally announced June 2020.
-
Quasiparticles states for integer- and fractional-charged electron wave packets
Authors:
X. K. Yue,
Y. Yin
Abstract:
It is well-known that Lorentzian voltage pulses with integer quantum flux can lead to noiseless current in quantum conductors. The current is carried by charged quasiparticles in the Fermi sea of the conductors, which have well-defined wave functions and have been named as "levitons". However, it is not clear how levitons evolve as the flux of the pulses changes continuously toward a fractional va…
▽ More
It is well-known that Lorentzian voltage pulses with integer quantum flux can lead to noiseless current in quantum conductors. The current is carried by charged quasiparticles in the Fermi sea of the conductors, which have well-defined wave functions and have been named as "levitons". However, it is not clear how levitons evolve as the flux of the pulses changes continuously toward a fractional value. To answer this question, we introduce a set of Wannier-like single-body wave functions, which can be used to describe the quantum states of the quasiparticles injected by Lorentzian pulses with arbitrary flux. We show that, by tuning the flux of the pulses, levitons can evolve into quasiparticles carrying fractional charges. In the meantime, additional fractional-charged quasiparticles can also be excited, which can form neutral electron-hole pairs. The information of these quasiparticles can be extracted from the shot noise of the current. These knowledge can be helpful for the time-resolved quantum control of propagating electrons in solid-state circuits.
△ Less
Submitted 25 February, 2021; v1 submitted 1 April, 2020;
originally announced April 2020.
-
Normal and abnormal electron-hole pairs in a voltage-pulse-driven quantum conductor
Authors:
X. K. Yue,
Y. Yin
Abstract:
Electron-hole pairs can be excited coherently in a quantum conductor by applying voltage pulses on its contact. We find that these electron-hole pairs can be classified into two kinds, whose excitation probabilities have different dependence on the Faraday flux of the pulse. Most of the pairs are of the first kind, which can be referred to as "normal" pairs. Their excitation probabilities increase…
▽ More
Electron-hole pairs can be excited coherently in a quantum conductor by applying voltage pulses on its contact. We find that these electron-hole pairs can be classified into two kinds, whose excitation probabilities have different dependence on the Faraday flux of the pulse. Most of the pairs are of the first kind, which can be referred to as "normal" pairs. Their excitation probabilities increase nearly monotonically with the flux and saturate to the maximum value 1 when the flux is large enough. In contrast, there exist "abnormal" pairs, whose excitation probabilities can exhibit oscillations with the flux. These pairs can only be excited by pulses with small width. Due to the oscillation of the probabilities, the abnormal pairs can lead to different features in the full counting statistics of the electron-hole pairs for pulses with integer and noninteger fluxes.
△ Less
Submitted 25 April, 2019;
originally announced April 2019.
-
Compact Generalized Non-local Network
Authors:
Kaiyu Yue,
Ming Sun,
Yuchen Yuan,
Feng Zhou,
Errui Ding,
Fuxin Xu
Abstract:
The non-local module is designed for capturing long-range spatio-temporal dependencies in images and videos. Although having shown excellent performance, it lacks the mechanism to model the interactions between positions across channels, which are of vital importance in recognizing fine-grained objects and actions. To address this limitation, we generalize the non-local module and take the correla…
▽ More
The non-local module is designed for capturing long-range spatio-temporal dependencies in images and videos. Although having shown excellent performance, it lacks the mechanism to model the interactions between positions across channels, which are of vital importance in recognizing fine-grained objects and actions. To address this limitation, we generalize the non-local module and take the correlations between the positions of any two channels into account. This extension utilizes the compact representation for multiple kernel functions with Taylor expansion that makes the generalized non-local module in a fast and low-complexity computation flow. Moreover, we implement our generalized non-local method within channel groups to ease the optimization. Experimental results illustrate the clear-cut improvements and practical applicability of the generalized non-local module on both fine-grained object recognition and video classification. Code is available at: https://github.com/KaiyuYue/cgnl-network.pytorch.
△ Less
Submitted 31 October, 2018; v1 submitted 31 October, 2018;
originally announced October 2018.
-
Fine-grained Video Categorization with Redundancy Reduction Attention
Authors:
Chen Zhu,
Xiao Tan,
Feng Zhou,
Xiao Liu,
Kaiyu Yue,
Errui Ding,
Yi Ma
Abstract:
For fine-grained categorization tasks, videos could serve as a better source than static images as videos have a higher chance of containing discriminative patterns. Nevertheless, a video sequence could also contain a lot of redundant and irrelevant frames. How to locate critical information of interest is a challenging task. In this paper, we propose a new network structure, known as Redundancy R…
▽ More
For fine-grained categorization tasks, videos could serve as a better source than static images as videos have a higher chance of containing discriminative patterns. Nevertheless, a video sequence could also contain a lot of redundant and irrelevant frames. How to locate critical information of interest is a challenging task. In this paper, we propose a new network structure, known as Redundancy Reduction Attention (RRA), which learns to focus on multiple discriminative patterns by sup- pressing redundant feature channels. Specifically, it firstly summarizes the video by weight-summing all feature vectors in the feature maps of selected frames with a spatio-temporal soft attention, and then predicts which channels to suppress or to enhance according to this summary with a learned non-linear transform. Suppression is achieved by modulating the feature maps and threshing out weak activations. The updated feature maps are then used in the next iteration. Finally, the video is classified based on multiple summaries. The proposed method achieves out- standing performances in multiple video classification datasets. Further- more, we have collected two large-scale video datasets, YouTube-Birds and YouTube-Cars, for future researches on fine-grained video categorization. The datasets are available at http://www.cs.umd.edu/~chenzhu/fgvc.
△ Less
Submitted 26 October, 2018;
originally announced October 2018.
-
Gradual Collective Upgrade of a Swarm of Autonomous Buoys for Dynamic Ocean Monitoring
Authors:
Francesco Vallegra,
David Mateo,
Grgur Tokić,
Roland Bouffanais,
Dick K. P. Yue
Abstract:
Swarms of autonomous surface vehicles equipped with environmental sensors and decentralized communications bring a new wave of attractive possibilities for the monitoring of dynamic features in oceans and other waterbodies. However, a key challenge in swarm robotics design is the efficient collective operation of heterogeneous systems. We present both theoretical analysis and field experiments on…
▽ More
Swarms of autonomous surface vehicles equipped with environmental sensors and decentralized communications bring a new wave of attractive possibilities for the monitoring of dynamic features in oceans and other waterbodies. However, a key challenge in swarm robotics design is the efficient collective operation of heterogeneous systems. We present both theoretical analysis and field experiments on the responsiveness in dynamic area coverage of a collective of 22 autonomous buoys, where 4 units are upgraded to a new design that allows them to move 80\% faster than the rest. This system is able to react on timescales of the minute to changes in areas on the order of a few thousand square meters. We have observed that this partial upgrade of the system significantly increases its average responsiveness, without necessarily improving the spatial uniformity of the deployment. These experiments show that the autonomous buoy designs and the cooperative control rule described in this work provide an efficient, flexible, and scalable solution for the pervasive and persistent monitoring of water environments.
△ Less
Submitted 31 August, 2018;
originally announced August 2018.
-
TreeSegNet: Adaptive Tree CNNs for Subdecimeter Aerial Image Segmentation
Authors:
Kai Yue,
Lei Yang,
Ruirui Li,
Wei Hu,
Fan Zhang,
Wei Li
Abstract:
For the task of subdecimeter aerial imagery segmentation, fine-grained semantic segmentation results are usually difficult to obtain because of complex remote sensing content and optical conditions. Recently, convolutional neural networks (CNNs) have shown outstanding performance on this task. Although many deep neural network structures and techniques have been applied to improve the accuracy, fe…
▽ More
For the task of subdecimeter aerial imagery segmentation, fine-grained semantic segmentation results are usually difficult to obtain because of complex remote sensing content and optical conditions. Recently, convolutional neural networks (CNNs) have shown outstanding performance on this task. Although many deep neural network structures and techniques have been applied to improve the accuracy, few have paid attention to better differentiating the easily confused classes. In this paper, we propose TreeSegNet which adopts an adaptive network to increase the classification rate at the pixelwise level. Specifically, based on the infrastructure of DeepUNet, a Tree-CNN block in which each node represents a ResNeXt unit is constructed adaptively according to the confusion matrix and the proposed TreeCutting algorithm. By transporting feature maps through concatenating connections, the Tree-CNN block fuses multiscale features and learns best weights for the model. In experiments on the ISPRS 2D semantic labeling Potsdam dataset, the results obtained by TreeSegNet are better than those of other published state-of-the-art methods. Detailed comparison and analysis show that the improvement brought by the adaptive Tree-CNN block is significant.
△ Less
Submitted 25 August, 2018; v1 submitted 29 April, 2018;
originally announced April 2018.
-
First application of combined isochronous and Schottky mass spectrometry: Half-lives of fully ionized 49Cr24+ and 53Fe26+ atoms
Authors:
X. L. Tu,
X. C. Chen,
J. T. Zhang,
P. Shuai,
K. Yue,
X. Xu,
C. Y. Fu,
Q. Zeng,
X. Zhou,
Y. M. Xing,
J. X. Wu,
R. S. Mao,
L. J. Mao,
K. H. Fang,
Z. Y. Sun,
M. Wang,
J. C. Yang,
Yu. A. Litvinov,
K. Blaum,
Y. H. Zhang,
Y. J. Yuan,
X. W. Ma,
X. H. Zhou,
H. S. Xu
Abstract:
Lifetime measurements of b -decaying highly charged ions have been performed in the storage ring CSRe by applying the isochronous Schottky mass spectrometry. The fully ionized 49Cr and 53Fe ions were produced in projectile fragmentation of 58Ni primary beam and were stored in the CSRe tuned into the isochronous ion-optical mode. The new resonant Schottky detector was applied to monitor the intensi…
▽ More
Lifetime measurements of b -decaying highly charged ions have been performed in the storage ring CSRe by applying the isochronous Schottky mass spectrometry. The fully ionized 49Cr and 53Fe ions were produced in projectile fragmentation of 58Ni primary beam and were stored in the CSRe tuned into the isochronous ion-optical mode. The new resonant Schottky detector was applied to monitor the intensities of stored uncooled 49Cr24+ and 53Fe26+ ions. The extracted half-lives T1/2(49Cr24+) = 44.0(27) min and T1/2(53Fe26+) = 8.47(19) min are in excellent agreement with the literature half-life values corrected for the disabled electron capture branchings. This is an important proof-of-principle step towards realizing the simultaneous mass and lifetime measurements on exotic nuclei at the future storage ring facilities.
△ Less
Submitted 8 April, 2018;
originally announced April 2018.
-
From solar cells to ocean buoys: Wide-bandwidth limits to absorption by metaparticle arrays
Authors:
M. Benzaouia,
G. Tokic,
O. D. Miller,
D. K. P. Yue,
S. G. Johnson
Abstract:
In this paper, we develop an approximate wide-bandwidth upper bound to the absorption enhancement in arrays of metaparticles, applicable to general wave-scattering problems and motivated here by ocean-buoy energy extraction. We show that general limits, including the well-known Yablonovitch result in solar cells, arise from reciprocity conditions. The use of reciprocity in the stochastic regime le…
▽ More
In this paper, we develop an approximate wide-bandwidth upper bound to the absorption enhancement in arrays of metaparticles, applicable to general wave-scattering problems and motivated here by ocean-buoy energy extraction. We show that general limits, including the well-known Yablonovitch result in solar cells, arise from reciprocity conditions. The use of reciprocity in the stochastic regime leads us to a corrected diffusion model from which we derive our main result: an analytical prediction of optimal array absorption that closely matches exact simulations for both random and optimized arrays under angle/frequency averaging. This result also enables us to propose and quantify approaches to increase performance through careful particle design and/or using external reflectors. We show in particular that the use of membranes on the water's surface allows substantial enhancement.
△ Less
Submitted 22 February, 2019; v1 submitted 2 April, 2018;
originally announced April 2018.
-
Swarm-Enabling Technology for Multi-Robot Systems
Authors:
Mohammadreza Chamanbaz,
David Mateo,
Brandon M. Zoss,
Grgur Tokić,
Erik Wilhelm,
Roland Bouffanais,
and Dick K. P. Yue
Abstract:
Swarm robotics has experienced a rapid expansion in recent years, primarily fueled by specialized multi-robot systems developed to achieve dedicated collective actions. These specialized platforms are in general designed with swarming considerations at the front and center. Key hardware and software elements required for swarming are often deeply embedded and integrated with the particular system.…
▽ More
Swarm robotics has experienced a rapid expansion in recent years, primarily fueled by specialized multi-robot systems developed to achieve dedicated collective actions. These specialized platforms are in general designed with swarming considerations at the front and center. Key hardware and software elements required for swarming are often deeply embedded and integrated with the particular system. However, given the noticeable increase in the number of low-cost mobile robots readily available, practitioners and hobbyists may start considering to assemble full-fledged swarms by minimally retrofitting such mobile platforms with a swarm-enabling technology. Here, we report one possible embodiment of such a technology designed to enable the assembly and the study of swarming in a range of general-purpose robotic systems. This is achieved by combining a modular and transferable software toolbox with a hardware suite composed of a collection of low-cost and off-the-shelf components. The developed technology can be ported to a relatively vast range of robotic platforms with minimal changes and high levels of scalability. This swarm-enabling technology has successfully been implemented on two distinct distributed multi-robot systems, a swarm of mobile marine buoys and a team of commercial terrestrial robots. We have tested the effectiveness of both of these distributed robotic systems in performing collective exploration and search scenarios, as well as other classical cooperative behaviors. Experimental results on different swarm behaviors are reported for the two platforms in uncontrolled environments and without any supporting infrastructure. The design of the associated software library allows for a seamless switch to other cooperative behaviors, and also offers the possibility to simulate newly designed collective behaviors prior to their implementation onto the platforms.
△ Less
Submitted 11 May, 2017;
originally announced May 2017.
-
Temperature dependence of the plastic scintillator detector for DAMPE
Authors:
Zhao-Min Wang,
Yu-Hong Yu,
Zhi-Yu Sun,
Ke Yue,
Duo Yan,
Yong-Jie Zhang,
Yong Zhou,
Fang Fang,
Wen-Xue Huang,
Jun-Ling Chen
Abstract:
The Plastic Scintillator Detector (PSD) is one of the main sub-detectors in the DArk Matter Particle Explorer (DAMPE) project. It will be operated over a large temperature range from -$10$ to $30^{\circ}$C, so the temperature effect of the whole detection system should be studied in detail. The temperature dependence of the PSD system is mainly contributed by the three parts: the plastic scintilla…
▽ More
The Plastic Scintillator Detector (PSD) is one of the main sub-detectors in the DArk Matter Particle Explorer (DAMPE) project. It will be operated over a large temperature range from -$10$ to $30^{\circ}$C, so the temperature effect of the whole detection system should be studied in detail. The temperature dependence of the PSD system is mainly contributed by the three parts: the plastic scintillator bar, the photomultiplier tube (PMT), and the Front End Electronics (FEE). These three parts have been studied in detail and the contribution of each part has been obtained and discussed. The temperature coefficient of the PMT is $-0.320(\pm0.033)\%/^{\circ}$C, and the coefficient of the plastic scintillator bar is $-0.036(\pm0.038)\%/^{\circ}$C. This result means that after subtracting the FEE pedestal, the variation of the signal amplitude of the PMT-scintillator system due to temperature mainly comes from the PMT, and the plastic scintillator bar is not sensitive to temperature over the operating range. Since the temperature effect cannot be ignored, the temperature dependence of the whole PSD has been also studied and a correction has been made to minimize this effect. The correction result shows that the effect of temperature on the signal amplitude of the PSD system can be suppressed.
△ Less
Submitted 11 December, 2016;
originally announced December 2016.
-
Plastic scintillation detectors for precision time-of-flight measurements of relativistic heavy ions
Authors:
Wen-Jian Lin,
Jian-Wei Zhao,
Bao-Hua Sun,
Liu-Chun He,
Wei-Ping Lin,
Chuan-Ye Liu,
Isao Tanihata,
Satoru Terashima,
Yi Tian,
Feng Wang,
Meng Wang,
Guang-Xin Zhang,
Xue-Heng Zhang,
Li-Hua Zhu,
Li-Min Duan,
Rong-Jiang Hu,
Zhong Liu,
Chen-Gui Lu,
Pei-Pei Ren,
Li-Na Sheng,
Zhi-Yu Sun,
Shi-Tao Wang,
Tao-Feng Wang,
Zhi-Guo Xu,
Duo Yan
, et al. (2 additional authors not shown)
Abstract:
Plastic scintillation detectors for Time-of-Flight (TOF) measurements are almost essential for event-by-event identification of relativistic rare isotopes. In this work, a pair of plastic scintillation detectors of 50 $\times$ 50 $\times$ 3$^{t}$ mm$^3$ and 80 $\times$ 100 $\times$ 3$^{t}$ mm$^3$ have been set up at the external target facility (ETF), Institute of Modern Physics. Their time, energ…
▽ More
Plastic scintillation detectors for Time-of-Flight (TOF) measurements are almost essential for event-by-event identification of relativistic rare isotopes. In this work, a pair of plastic scintillation detectors of 50 $\times$ 50 $\times$ 3$^{t}$ mm$^3$ and 80 $\times$ 100 $\times$ 3$^{t}$ mm$^3$ have been set up at the external target facility (ETF), Institute of Modern Physics. Their time, energy and position responses are measured with $^{18}$O primary beam at 400 MeV/nucleon. After the off-line walk-effect and position corrections, the time resolution of the two detectors are determined to be 27 ps ($σ$) and 36 ps ($σ$), respectively. Both detectors have nearly the same energy resolution of 3$\%$ ($σ$) and position resolution of 2 mm ($σ$). The detectors have been used successfully in nuclear reaction cross section measurements, and will be be employed for upgrading RIBLL2 beam line at IMP as well as for the high energy branch at HIAF.
△ Less
Submitted 27 September, 2016;
originally announced September 2016.
-
Design and construction of a multi-layer CsI(Tl) telescope for high-energy reaction studies
Authors:
D. Yan,
Z. Y. Sun,
K. Yue,
S. T. Wang,
X. H. Zhang,
Y. H. Yu,
J. L. Chen,
S. W. Tang,
F. Fang,
Y. Zhou,
Y. Sun,
Z. M. Wang,
Y. Z. Sun
Abstract:
A prototype of a new CsI(Tl) telescope, which will be used in the reaction studies of light isotopes with energy of several hundred AMeV, has been constructed and tested at the Institute of Modern Physics, Chinese Academy of Sciences. The telescope has a multi-layer structure and the range information will be obtained to improve the particle identification performance. This prototype has seven lay…
▽ More
A prototype of a new CsI(Tl) telescope, which will be used in the reaction studies of light isotopes with energy of several hundred AMeV, has been constructed and tested at the Institute of Modern Physics, Chinese Academy of Sciences. The telescope has a multi-layer structure and the range information will be obtained to improve the particle identification performance. This prototype has seven layers of different thickness. A 5.0% (FWHM) energy resolution has been extracted for one of the layers in a beam test experiment. Obvious improvement for the identification of $^{14}$O and $^{15}$O isotopes was achieved by using the range information.
△ Less
Submitted 19 November, 2015;
originally announced November 2015.
-
A large area plastic scintillation detector with 4-corner-readout
Authors:
Shu-wen Tang,
Yu-hong Yu,
Yong Zhou,
Zhi-yu Sun,
Xue-heng Zhang,
Shi-tao Wang,
Ke Yue,
Long-xiang Liu,
Fang Fang,
Duo Yan,
Yu Sun,
Zhao-min Wang
Abstract:
A 760 $\times$ 760 $\times$ 30 mm$^3$ plastic scintillation detector viewed by photomultiplier tubes (PMTs) from four corners has been developed, and the detector has been tested with cosmic rays and $γ$ rays. A position-independent effective time T$_{eff}$ has been found, indicating this detector can be used as a TOF detector. The hit position can also be reconstructed by the time from four corne…
▽ More
A 760 $\times$ 760 $\times$ 30 mm$^3$ plastic scintillation detector viewed by photomultiplier tubes (PMTs) from four corners has been developed, and the detector has been tested with cosmic rays and $γ$ rays. A position-independent effective time T$_{eff}$ has been found, indicating this detector can be used as a TOF detector. The hit position can also be reconstructed by the time from four corners. A TOF resolution of 236 ps and a position resolution of 48 mm have been achieved, and the detection efficiency has also been investigated.
△ Less
Submitted 25 August, 2015; v1 submitted 23 August, 2015;
originally announced August 2015.