-
SA-Occ: Satellite-Assisted 3D Occupancy Prediction in Real World
Authors:
Chen Chen,
Zhirui Wang,
Taowei Sheng,
Yi Jiang,
Yundu Li,
Peirui Cheng,
Luning Zhang,
Kaiqiang Chen,
Yanfeng Hu,
Xue Yang,
Xian Sun
Abstract:
Existing vision-based 3D occupancy prediction methods are inherently limited in accuracy due to their exclusive reliance on street-view imagery, neglecting the potential benefits of incorporating satellite views. We propose SA-Occ, the first Satellite-Assisted 3D occupancy prediction model, which leverages GPS & IMU to integrate historical yet readily available satellite imagery into real-time app…
▽ More
Existing vision-based 3D occupancy prediction methods are inherently limited in accuracy due to their exclusive reliance on street-view imagery, neglecting the potential benefits of incorporating satellite views. We propose SA-Occ, the first Satellite-Assisted 3D occupancy prediction model, which leverages GPS & IMU to integrate historical yet readily available satellite imagery into real-time applications, effectively mitigating limitations of ego-vehicle perceptions, involving occlusions and degraded performance in distant regions. To address the core challenges of cross-view perception, we propose: 1) Dynamic-Decoupling Fusion, which resolves inconsistencies in dynamic regions caused by the temporal asynchrony between satellite and street views; 2) 3D-Proj Guidance, a module that enhances 3D feature extraction from inherently 2D satellite imagery; and 3) Uniform Sampling Alignment, which aligns the sampling density between street and satellite views. Evaluated on Occ3D-nuScenes, SA-Occ achieves state-of-the-art performance, especially among single-frame methods, with a 39.05% mIoU (a 6.97% improvement), while incurring only 6.93 ms of additional latency per frame. Our code and newly curated dataset are available at https://github.com/chenchen235/SA-Occ.
△ Less
Submitted 20 March, 2025;
originally announced March 2025.
-
Design of the Global Reconstruction Logic in the Belle II Level-1 Trigger system
Authors:
Y. -T. Lai,
T. Koga,
Y. Iwasaki,
Y. Ahn,
H. Bae,
M. Campajola,
B. G. Cheon,
H. -E. Cho,
T. Ferber,
I. Haide,
G. Heine,
C. -L. Hsu,
C. Kiesling,
C. -H. Kim,
J. B. Kim,
K. Kim,
S. H. Kim,
I. S. Lee,
M. J. Lee,
Y. P. Liao,
J. Lin,
A. Little,
H. K. Moon,
H. Nakazawa,
M. Neu
, et al. (10 additional authors not shown)
Abstract:
The Belle~II experiment is designed to search for physics beyond the Standard Model by investigating rare decays at the SuperKEKB \(e^{+}e^{-}\) collider. Owing to the significant beam background at high luminosity, the data acquisition system employs a hardware-based Level-1~Trigger to reduce the readout data throughput by selecting collision events of interest in real time. The Belle~II Level-1~…
▽ More
The Belle~II experiment is designed to search for physics beyond the Standard Model by investigating rare decays at the SuperKEKB \(e^{+}e^{-}\) collider. Owing to the significant beam background at high luminosity, the data acquisition system employs a hardware-based Level-1~Trigger to reduce the readout data throughput by selecting collision events of interest in real time. The Belle~II Level-1~Trigger system utilizes FPGAs to reconstruct various detector observables from the raw data for trigger decision-making. The Global Reconstruction Logic receives these processed observables from four sub-trigger systems and provides a global summary for the final trigger decision. Its logic encompasses charged particle tracking, matching between sub-triggers, and the identification of special event topologies associated with low-multiplicity decays. This article discusses the hardware devices, FPGA firmware, integration with peripheral systems, and the design and performance of the trigger algorithms implemented within the Global Reconstruction Logic.
△ Less
Submitted 3 March, 2025;
originally announced March 2025.
-
Manipulating the symmetry of photon-dressed electronic states
Authors:
Changhua Bao,
Michael Schüler,
Teng Xiao,
Fei Wang,
Haoyuan Zhong,
Tianyun Lin,
Xuanxi Cai,
Tianshuang Sheng,
Xiao Tang,
Hongyun Zhang,
Pu Yu,
Zhiyuan Sun,
Wenhui Duan,
Shuyun Zhou
Abstract:
Strong light-matter interaction provides opportunities for tailoring the physical properties of quantum materials on the ultrafast timescale by forming photon-dressed electronic states, i.e., Floquet-Bloch states. While the light field can in principle imprint its symmetry properties onto the photon-dressed electronic states, so far, how to experimentally detect and further engineer the symmetry o…
▽ More
Strong light-matter interaction provides opportunities for tailoring the physical properties of quantum materials on the ultrafast timescale by forming photon-dressed electronic states, i.e., Floquet-Bloch states. While the light field can in principle imprint its symmetry properties onto the photon-dressed electronic states, so far, how to experimentally detect and further engineer the symmetry of photon-dressed electronic states remains elusive. Here by utilizing time- and angle-resolved photoemission spectroscopy (TrARPES) with polarization-dependent study, we directly visualize the parity symmetry of Floquet-Bloch states in black phosphorus. The photon-dressed sideband exhibits opposite photoemission intensity to the valence band at the $Γ$ point,suggesting a switch of the parity induced by the light field. Moreover, a "hot spot" with strong intensity confined near $Γ$ is observed, indicating a momentum-dependent modulation beyond the parity switch. Combining with theoretical calculations, we reveal the light-induced engineering of the wave function of the Floquet-Bloch states as a result of the hybridization between the conduction and valence bands with opposite parities, and show that the "hot spot" is intrinsically dictated by the symmetry properties of black phosphorus. Our work suggests TrARPES as a direct probe for the parity of the photon-dressed electronic states with energy- and momentum-resolved information, providing an example for engineering the wave function and symmetry of such photon-dressed electronic states via Floquet engineering.
△ Less
Submitted 9 December, 2024;
originally announced December 2024.
-
Design fast Rydberg blockade SWAP gates with synthetic modulated driving
Authors:
Xin Wang,
Tianze Sheng,
Yuan Sun
Abstract:
The cold atom qubit platform emerges as an attractive choice for the next stage of quantum computation research, where a special family of synthetic analytical pulses has considerably improved the experimental performance of Controlled-PHASE Rydberg blockade gates in recent studies. The success of Controlled-PHASE Rydberg blockade gates triggers the intriguing question of whether the two-qubit Ryd…
▽ More
The cold atom qubit platform emerges as an attractive choice for the next stage of quantum computation research, where a special family of synthetic analytical pulses has considerably improved the experimental performance of Controlled-PHASE Rydberg blockade gates in recent studies. The success of Controlled-PHASE Rydberg blockade gates triggers the intriguing question of whether the two-qubit Rydberg blockade gate SWAP gate exists. Via investigating the transition linkage structure, we provide a definitive answer to this question and establish the method of fast SWAP Rydberg blockade gates with synthetic continuously-modulated driving. These gate protocols use careful analysis to properly generate coherent population transfer and phase accumulation of the wave function in the atom-laser interaction process. They can adapt to finite Rydberg blockade strengths and bear considerable resistance some major adverse effects such as laser fluctuations. Further examinations reveal that we can anticipate satisfying performances of the method with currently available experimental techniques in relevant research areas.
△ Less
Submitted 14 November, 2024;
originally announced November 2024.
-
Consistency Based Weakly Self-Supervised Learning for Human Activity Recognition with Wearables
Authors:
Taoran Sheng,
Manfred Huber
Abstract:
While the widely available embedded sensors in smartphones and other wearable devices make it easier to obtain data of human activities, recognizing different types of human activities from sensor-based data remains a difficult research topic in ubiquitous computing. One reason for this is that most of the collected data is unlabeled. However, many current human activity recognition (HAR) systems…
▽ More
While the widely available embedded sensors in smartphones and other wearable devices make it easier to obtain data of human activities, recognizing different types of human activities from sensor-based data remains a difficult research topic in ubiquitous computing. One reason for this is that most of the collected data is unlabeled. However, many current human activity recognition (HAR) systems are based on supervised methods, which heavily rely on the labels of the data. We describe a weakly self-supervised approach in this paper that consists of two stages: (1) In stage one, the model learns from the nature of human activities by projecting the data into an embedding space where similar activities are grouped together; (2) In stage two, the model is fine-tuned using similarity information in a few-shot learning fashion using the similarity information of the data. This allows downstream classification or clustering tasks to benefit from the embeddings. Experiments on three benchmark datasets demonstrate the framework's effectiveness and show that our approach can help the clustering algorithm achieve comparable performance in identifying and categorizing the underlying human activities as pure supervised techniques applied directly to a corresponding fully labeled data set.
△ Less
Submitted 29 July, 2024;
originally announced August 2024.
-
FTF-ER: Feature-Topology Fusion-Based Experience Replay Method for Continual Graph Learning
Authors:
Jinhui Pang,
Changqing Lin,
Xiaoshuai Hao,
Rong Yin,
Zixuan Wang,
Zhihui Zhang,
Jinglin He,
Huang Tai Sheng
Abstract:
Continual graph learning (CGL) is an important and challenging task that aims to extend static GNNs to dynamic task flow scenarios. As one of the mainstream CGL methods, the experience replay (ER) method receives widespread attention due to its superior performance. However, existing ER methods focus on identifying samples by feature significance or topological relevance, which limits their utiliz…
▽ More
Continual graph learning (CGL) is an important and challenging task that aims to extend static GNNs to dynamic task flow scenarios. As one of the mainstream CGL methods, the experience replay (ER) method receives widespread attention due to its superior performance. However, existing ER methods focus on identifying samples by feature significance or topological relevance, which limits their utilization of comprehensive graph data. In addition, the topology-based ER methods only consider local topological information and add neighboring nodes to the buffer, which ignores the global topological information and increases memory overhead. To bridge these gaps, we propose a novel method called Feature-Topology Fusion-based Experience Replay (FTF-ER) to effectively mitigate the catastrophic forgetting issue with enhanced efficiency. Specifically, from an overall perspective to maximize the utilization of the entire graph data, we propose a highly complementary approach including both feature and global topological information, which can significantly improve the effectiveness of the sampled nodes. Moreover, to further utilize global topological information, we propose Hodge Potential Score (HPS) as a novel module to calculate the topological importance of nodes. HPS derives a global node ranking via Hodge decomposition on graphs, providing more accurate global topological information compared to neighbor sampling. By excluding neighbor sampling, HPS significantly reduces buffer storage costs for acquiring topological information and simultaneously decreases training time. Compared with state-of-the-art methods, FTF-ER achieves a significant improvement of 3.6% in AA and 7.1% in AF on the OGB-Arxiv dataset, demonstrating its superior performance in the class-incremental learning setting.
△ Less
Submitted 8 August, 2024; v1 submitted 28 July, 2024;
originally announced July 2024.
-
Weakly-Supervised Detection of Bone Lesions in CT
Authors:
Tao Sheng,
Tejas Sudharshan Mathai,
Alexander Shieh,
Ronald M. Summers
Abstract:
The skeletal region is one of the common sites of metastatic spread of cancer in the breast and prostate. CT is routinely used to measure the size of lesions in the bones. However, they can be difficult to spot due to the wide variations in their sizes, shapes, and appearances. Precise localization of such lesions would enable reliable tracking of interval changes (growth, shrinkage, or unchanged…
▽ More
The skeletal region is one of the common sites of metastatic spread of cancer in the breast and prostate. CT is routinely used to measure the size of lesions in the bones. However, they can be difficult to spot due to the wide variations in their sizes, shapes, and appearances. Precise localization of such lesions would enable reliable tracking of interval changes (growth, shrinkage, or unchanged status). To that end, an automated technique to detect bone lesions is highly desirable. In this pilot work, we developed a pipeline to detect bone lesions (lytic, blastic, and mixed) in CT volumes via a proxy segmentation task. First, we used the bone lesions that were prospectively marked by radiologists in a few 2D slices of CT volumes and converted them into weak 3D segmentation masks. Then, we trained a 3D full-resolution nnUNet model using these weak 3D annotations to segment the lesions and thereby detected them. Our automated method detected bone lesions in CT with a precision of 96.7% and recall of 47.3% despite the use of incomplete and partial training data. To the best of our knowledge, we are the first to attempt the direct detection of bone lesions in CT via a proxy segmentation task.
△ Less
Submitted 31 January, 2024;
originally announced February 2024.
-
Long-range near-side correlation in $e^+e^-$ Collisions at 183-209 GeV with ALEPH Archived Data
Authors:
Yu-Chen Chen,
Yi Chen,
Anthony Badea,
Austin Baty,
Gian Michele Innocenti,
Marcello Maggi,
Christopher McGinn,
Michael Peters,
Tzu-An Sheng,
Jesse Thaler,
Yen-Jie Lee
Abstract:
The first measurement of two-particle angular correlations for charged particles with LEP-II data is presented. The study is performed using archived hadronic $e^+e^-$ data collected by ALEPH at center-of-mass energies up to 209 GeV, above the $W^+W^-$ production threshold, which provide access to unprecedented charged-particle multiplicities and more complex color-string configurations if compare…
▽ More
The first measurement of two-particle angular correlations for charged particles with LEP-II data is presented. The study is performed using archived hadronic $e^+e^-$ data collected by ALEPH at center-of-mass energies up to 209 GeV, above the $W^+W^-$ production threshold, which provide access to unprecedented charged-particle multiplicities and more complex color-string configurations if compared to previous measurements at LEP-I energies. An intriguing long-range near-side excess is observed in the correlation function measured with respect to the thrust axis in the highest multiplicity interval $N_{\mathrm{trk}}\geq 50$. Such a structure is not predicted by the Monte-Carlo simulation. The harmonic anisotropy coefficients $v_n$, which result from the Fourier expansion of the two-particle correlation functions, were also measured for the first time in $e^+e^-$ data, and compared to PYTHIA6 predictions and to the results obtained in proton-proton collisions. The results presented in the Letter provide novel experimental constraints on the formation of collective phenomena in point-like $e^+e^-$ collisions.
△ Less
Submitted 14 August, 2024; v1 submitted 8 December, 2023;
originally announced December 2023.
-
Analysis note: two-particle correlation in $e^+e^-$ collisions at 91-209 GeV with archived ALEPH data
Authors:
Yu-Chen Chen,
Yen-Jie Lee,
Yi Chen,
Paoti Chang,
Chris McGinn,
Tzu-An Sheng,
Gian Michele Innocenti,
Marcello Maggi
Abstract:
The first measurement of two-particle angular correlations for charged particles produced in $e^+e^-$ annihilation up to $\sqrt{s}$ = 209 GeV is presented. Hadronic $e^+e^-$ data, archived at center-of-mass energies ranging from 91 to 209 GeV, were collected using the ALEPH detector at LEP between 1992 and 2000. The angular correlation functions have been measured across a wide range of pseudorapi…
▽ More
The first measurement of two-particle angular correlations for charged particles produced in $e^+e^-$ annihilation up to $\sqrt{s}$ = 209 GeV is presented. Hadronic $e^+e^-$ data, archived at center-of-mass energies ranging from 91 to 209 GeV, were collected using the ALEPH detector at LEP between 1992 and 2000. The angular correlation functions have been measured across a wide range of pseudorapidities and the full azimuth in bins of charged particle multiplicity. This is the first such measurement using LEP-II data. With LEP-II data at 91 GeV, neither the beam coordinate analysis nor the thrust coordinate analysis reveals significant long-range correlations, consistent with the finding in the previous measurement with the LEP-I sample. Results for $e^+e^-$ data at energies above 91 GeV, which allow for higher event multiplicities reaching approximately 50, are presented for the first time. A long-range near-side excess in the correlation function has been identified in the thrust axis analysis. Moreover, the two-particle correlation functions were decomposed using a Fourier series, and the resulting Fourier coefficients $v_n$ were compared with event generator outputs. In events with high multiplicity, featuring more than 50 particles, the extracted $v_2$ and $v_3$ magnitudes from the data are higher than those from the Monte Carlo reference.
△ Less
Submitted 26 January, 2024; v1 submitted 18 September, 2023;
originally announced September 2023.
-
Weakly Supervised Multi-Task Representation Learning for Human Activity Analysis Using Wearables
Authors:
Taoran Sheng,
Manfred Huber
Abstract:
Sensor data streams from wearable devices and smart environments are widely studied in areas like human activity recognition (HAR), person identification, or health monitoring. However, most of the previous works in activity and sensor stream analysis have been focusing on one aspect of the data, e.g. only recognizing the type of the activity or only identifying the person who performed the activi…
▽ More
Sensor data streams from wearable devices and smart environments are widely studied in areas like human activity recognition (HAR), person identification, or health monitoring. However, most of the previous works in activity and sensor stream analysis have been focusing on one aspect of the data, e.g. only recognizing the type of the activity or only identifying the person who performed the activity. We instead propose an approach that uses a weakly supervised multi-output siamese network that learns to map the data into multiple representation spaces, where each representation space focuses on one aspect of the data. The representation vectors of the data samples are positioned in the space such that the data with the same semantic meaning in that aspect are closely located to each other. Therefore, as demonstrated with a set of experiments, the trained model can provide metrics for clustering data based on multiple aspects, allowing it to address multiple tasks simultaneously and even to outperform single task supervised methods in many situations. In addition, further experiments are presented that in more detail analyze the effect of the architecture and of using multiple tasks within this framework, that investigate the scalability of the model to include additional tasks, and that demonstrate the ability of the framework to combine data for which only partial relationship information with respect to the target tasks is available.
△ Less
Submitted 6 August, 2023;
originally announced August 2023.
-
Unsupervised Embedding Learning for Human Activity Recognition Using Wearable Sensor Data
Authors:
Taoran Sheng,
Manfred Huber
Abstract:
The embedded sensors in widely used smartphones and other wearable devices make the data of human activities more accessible. However, recognizing different human activities from the wearable sensor data remains a challenging research problem in ubiquitous computing. One of the reasons is that the majority of the acquired data has no labels. In this paper, we present an unsupervised approach, whic…
▽ More
The embedded sensors in widely used smartphones and other wearable devices make the data of human activities more accessible. However, recognizing different human activities from the wearable sensor data remains a challenging research problem in ubiquitous computing. One of the reasons is that the majority of the acquired data has no labels. In this paper, we present an unsupervised approach, which is based on the nature of human activity, to project the human activities into an embedding space in which similar activities will be located closely together. Using this, subsequent clustering algorithms can benefit from the embeddings, forming behavior clusters that represent the distinct activities performed by a person. Results of experiments on three labeled benchmark datasets demonstrate the effectiveness of the framework and show that our approach can help the clustering algorithm achieve improved performance in identifying and categorizing the underlying human activities compared to unsupervised techniques applied directly to the original data set.
△ Less
Submitted 21 July, 2023;
originally announced July 2023.
-
Siamese Networks for Weakly Supervised Human Activity Recognition
Authors:
Taoran Sheng,
Manfred Huber
Abstract:
Deep learning has been successfully applied to human activity recognition. However, training deep neural networks requires explicitly labeled data which is difficult to acquire. In this paper, we present a model with multiple siamese networks that are trained by using only the information about the similarity between pairs of data samples without knowing the explicit labels. The trained model maps…
▽ More
Deep learning has been successfully applied to human activity recognition. However, training deep neural networks requires explicitly labeled data which is difficult to acquire. In this paper, we present a model with multiple siamese networks that are trained by using only the information about the similarity between pairs of data samples without knowing the explicit labels. The trained model maps the activity data samples into fixed size representation vectors such that the distance between the vectors in the representation space approximates the similarity of the data samples in the input space. Thus, the trained model can work as a metric for a wide range of different clustering algorithms. The training process minimizes a similarity loss function that forces the distance metric to be small for pairs of samples from the same kind of activity, and large for pairs of samples from different kinds of activities. We evaluate the model on three datasets to verify its effectiveness in segmentation and recognition of continuous human activity sequences.
△ Less
Submitted 17 July, 2023;
originally announced July 2023.
-
Chat-REC: Towards Interactive and Explainable LLMs-Augmented Recommender System
Authors:
Yunfan Gao,
Tao Sheng,
Youlin Xiang,
Yun Xiong,
Haofen Wang,
Jiawei Zhang
Abstract:
Large language models (LLMs) have demonstrated their significant potential to be applied for addressing various application tasks. However, traditional recommender systems continue to face great challenges such as poor interactivity and explainability, which actually also hinder their broad deployment in real-world systems. To address these limitations, this paper proposes a novel paradigm called…
▽ More
Large language models (LLMs) have demonstrated their significant potential to be applied for addressing various application tasks. However, traditional recommender systems continue to face great challenges such as poor interactivity and explainability, which actually also hinder their broad deployment in real-world systems. To address these limitations, this paper proposes a novel paradigm called Chat-Rec (ChatGPT Augmented Recommender System) that innovatively augments LLMs for building conversational recommender systems by converting user profiles and historical interactions into prompts. Chat-Rec is demonstrated to be effective in learning user preferences and establishing connections between users and products through in-context learning, which also makes the recommendation process more interactive and explainable. What's more, within the Chat-Rec framework, user's preferences can transfer to different products for cross-domain recommendations, and prompt-based injection of information into LLMs can also handle the cold-start scenarios with new items. In our experiments, Chat-Rec effectively improve the results of top-k recommendations and performs better in zero-shot rating prediction task. Chat-Rec offers a novel approach to improving recommender systems and presents new practical scenarios for the implementation of AIGC (AI generated content) in recommender system studies.
△ Less
Submitted 3 April, 2023; v1 submitted 25 March, 2023;
originally announced March 2023.
-
The Present and Future of QCD
Authors:
P. Achenbach,
D. Adhikari,
A. Afanasev,
F. Afzal,
C. A. Aidala,
A. Al-bataineh,
D. K. Almaalol,
M. Amaryan,
D. Androić,
W. R. Armstrong,
M. Arratia,
J. Arrington,
A. Asaturyan,
E. C. Aschenauer,
H. Atac,
H. Avakian,
T. Averett,
C. Ayerbe Gayoso,
X. Bai,
K. N. Barish,
N. Barnea,
G. Basar,
M. Battaglieri,
A. A. Baty,
I. Bautista
, et al. (378 additional authors not shown)
Abstract:
This White Paper presents the community inputs and scientific conclusions from the Hot and Cold QCD Town Meeting that took place September 23-25, 2022 at MIT, as part of the Nuclear Science Advisory Committee (NSAC) 2023 Long Range Planning process. A total of 424 physicists registered for the meeting. The meeting highlighted progress in Quantum Chromodynamics (QCD) nuclear physics since the 2015…
▽ More
This White Paper presents the community inputs and scientific conclusions from the Hot and Cold QCD Town Meeting that took place September 23-25, 2022 at MIT, as part of the Nuclear Science Advisory Committee (NSAC) 2023 Long Range Planning process. A total of 424 physicists registered for the meeting. The meeting highlighted progress in Quantum Chromodynamics (QCD) nuclear physics since the 2015 LRP (LRP15) and identified key questions and plausible paths to obtaining answers to those questions, defining priorities for our research over the coming decade. In defining the priority of outstanding physics opportunities for the future, both prospects for the short (~ 5 years) and longer term (5-10 years and beyond) are identified together with the facilities, personnel and other resources needed to maximize the discovery potential and maintain United States leadership in QCD physics worldwide. This White Paper is organized as follows: In the Executive Summary, we detail the Recommendations and Initiatives that were presented and discussed at the Town Meeting, and their supporting rationales. Section 2 highlights major progress and accomplishments of the past seven years. It is followed, in Section 3, by an overview of the physics opportunities for the immediate future, and in relation with the next QCD frontier: the EIC. Section 4 provides an overview of the physics motivations and goals associated with the EIC. Section 5 is devoted to the workforce development and support of diversity, equity and inclusion. This is followed by a dedicated section on computing in Section 6. Section 7 describes the national need for nuclear data science and the relevance to QCD research.
△ Less
Submitted 4 March, 2023;
originally announced March 2023.
-
Modeling Global Distribution for Federated Learning with Label Distribution Skew
Authors:
Tao Sheng,
Chengchao Shen,
Yuan Liu,
Yeyu Ou,
Zhe Qu,
Jianxin Wang
Abstract:
Federated learning achieves joint training of deep models by connecting decentralized data sources, which can significantly mitigate the risk of privacy leakage. However, in a more general case, the distributions of labels among clients are different, called ``label distribution skew''. Directly applying conventional federated learning without consideration of label distribution skew issue signifi…
▽ More
Federated learning achieves joint training of deep models by connecting decentralized data sources, which can significantly mitigate the risk of privacy leakage. However, in a more general case, the distributions of labels among clients are different, called ``label distribution skew''. Directly applying conventional federated learning without consideration of label distribution skew issue significantly hurts the performance of the global model. To this end, we propose a novel federated learning method, named FedMGD, to alleviate the performance degradation caused by the label distribution skew issue. It introduces a global Generative Adversarial Network to model the global data distribution without access to local datasets, so the global model can be trained using the global information of data distribution without privacy leakage. The experimental results demonstrate that our proposed method significantly outperforms the state-of-the-art on several public benchmarks. Code is available at \url{https://github.com/Sheng-T/FedMGD}.
△ Less
Submitted 17 December, 2022;
originally announced December 2022.
-
First measurement of anti-k$_\mathrm{T}$ jet spectra and jet substructure using the archived ALEPH $e^+e^-$ data at 91.2 GeV
Authors:
Yi Chen,
Austin Baty,
Dennis Perepelitsa,
Christopher McGinn,
Jesse Thaler,
Marcello Maggi,
Paoti Chang,
Tzu-An Sheng,
Yang-Ting Chien,
Yen-Jie Lee
Abstract:
We present the first anti-k$_{T}$ jet spectrum and substructure measurements using the archived ALEPH $e^+e^-$ data taken in 1994 at a center of mass energy of $\sqrt{s} = 91.2$ GeV. Jets are reconstructed with the anti-k$_{T}$ algorithm with a resolution parameter of 0.4. It is the cleanest test of jets and QCD without the complication of hadronic initial states. The fixed center-of-mass energy a…
▽ More
We present the first anti-k$_{T}$ jet spectrum and substructure measurements using the archived ALEPH $e^+e^-$ data taken in 1994 at a center of mass energy of $\sqrt{s} = 91.2$ GeV. Jets are reconstructed with the anti-k$_{T}$ algorithm with a resolution parameter of 0.4. It is the cleanest test of jets and QCD without the complication of hadronic initial states. The fixed center-of-mass energy also allows the first direct test of pQCD calculation. We present both the inclusive jet energy spectrum and the leading dijet energy spectra, together with a number of substructure observables. They are compared to predictions from PYTHIA6, PYTHIA8, Sherpa, HERWIG, VINCIA, and PYQUEN. None of the models fully reproduce the data. The data are also compared to two perturbative QCD calculations at NLO and with NLL'+R resummation. The results can also serve as reference measurements to compare to results from hadronic colliders. Future directions, including testing jet clustering algorithms designed for future electron-ion collider experiments, will also be discussed.
△ Less
Submitted 24 November, 2022;
originally announced November 2022.
-
ReCo: A Dataset for Residential Community Layout Planning
Authors:
Xi Chen,
Yun Xiong,
Siqi Wang,
Haofen Wang,
Tao Sheng,
Yao Zhang,
Yu Ye
Abstract:
Layout planning is centrally important in the field of architecture and urban design. Among the various basic units carrying urban functions, residential community plays a vital part for supporting human life. Therefore, the layout planning of residential community has always been of concern, and has attracted particular attention since the advent of deep learning that facilitates the automated la…
▽ More
Layout planning is centrally important in the field of architecture and urban design. Among the various basic units carrying urban functions, residential community plays a vital part for supporting human life. Therefore, the layout planning of residential community has always been of concern, and has attracted particular attention since the advent of deep learning that facilitates the automated layout generation and spatial pattern recognition. However, the research circles generally suffer from the insufficiency of residential community layout benchmark or high-quality datasets, which hampers the future exploration of data-driven methods for residential community layout planning. The lack of datasets is largely due to the difficulties of large-scale real-world residential data acquisition and long-term expert screening. In order to address the issues and advance a benchmark dataset for various intelligent spatial design and analysis applications in the development of smart city, we introduce Residential Community Layout Planning (ReCo) Dataset, which is the first and largest open-source vector dataset related to real-world community to date. ReCo Dataset is presented in multiple data formats with 37,646 residential community layout plans, covering 598,728 residential buildings with height information. ReCo can be conveniently adapted for residential community layout related urban design tasks, e.g., generative layout design, morphological pattern recognition and spatial evaluation. To validate the utility of ReCo in automated residential community layout planning, two Generative Adversarial Network (GAN) based generative models are further applied to the dataset. We expect ReCo Dataset to inspire more creative and practical work in intelligent design and beyond. The ReCo Dataset is published at: https://www.kaggle.com/fdudsde/reco-dataset.
△ Less
Submitted 27 August, 2023; v1 submitted 8 June, 2022;
originally announced June 2022.
-
Jet energy spectrum and substructure in $e^+e^-$ collisions at 91.2 GeV with ALEPH Archived Data
Authors:
Yi Chen,
Anthony Badea,
Austin Baty,
Paoti Chang,
Yang-Ting Chien,
Gian Michele Innocenti,
Marcello Maggi,
Christopher McGinn,
Dennis V. Perepelitsa,
Michael Peters,
Tzu-An Sheng,
Jesse Thaler,
Yen-Jie Lee
Abstract:
The first measurements of energy spectra and substructure of anti-$k_{T}$ jets in hadronic $Z^0$ decays in $e^+e^-$ collisions are presented. The archived $e^+e^-$ annihilation data at a center-of-mass energy of 91.2 GeV were collected with the ALEPH detector at LEP in 1994. In addition to inclusive jet and leading dijet energy spectra, various jet substructure observables are analyzed as a functi…
▽ More
The first measurements of energy spectra and substructure of anti-$k_{T}$ jets in hadronic $Z^0$ decays in $e^+e^-$ collisions are presented. The archived $e^+e^-$ annihilation data at a center-of-mass energy of 91.2 GeV were collected with the ALEPH detector at LEP in 1994. In addition to inclusive jet and leading dijet energy spectra, various jet substructure observables are analyzed as a function of jet energy which includes groomed and ungroomed jet mass to jet energy ratios, groomed momentum sharing, and groomed jet radius. The results are compared with perturbative QCD calculations and predictions from the SHERPA, HERWIG v7.1.5, PYTHIA 6, PYTHIA 8, and PYQUEN event generators. The jet energy spectra agree with perturbative QCD calculations which include the treatment of logarithms of the jet radius and threshold logarithms. None of the event generators give a fully satisfactory description of the data.
△ Less
Submitted 5 April, 2022; v1 submitted 18 November, 2021;
originally announced November 2021.
-
Bidirectional Regression for Arbitrary-Shaped Text Detection
Authors:
Tao Sheng,
Zhouhui Lian
Abstract:
Arbitrary-shaped text detection has recently attracted increasing interests and witnessed rapid development with the popularity of deep learning algorithms. Nevertheless, existing approaches often obtain inaccurate detection results, mainly due to the relatively weak ability to utilize context information and the inappropriate choice of offset references. This paper presents a novel text instance…
▽ More
Arbitrary-shaped text detection has recently attracted increasing interests and witnessed rapid development with the popularity of deep learning algorithms. Nevertheless, existing approaches often obtain inaccurate detection results, mainly due to the relatively weak ability to utilize context information and the inappropriate choice of offset references. This paper presents a novel text instance expression which integrates both foreground and background information into the pipeline, and naturally uses the pixels near text boundaries as the offset starts. Besides, a corresponding post-processing algorithm is also designed to sequentially combine the four prediction results and reconstruct the text instance accurately. We evaluate our method on several challenging scene text benchmarks, including both curved and multi-oriented text datasets. Experimental results demonstrate that the proposed approach obtains superior or competitive performance compared to other state-of-the-art methods, e.g., 83.4% F-score for Total-Text, 82.4% F-score for MSRA-TD500, etc.
△ Less
Submitted 13 July, 2021;
originally announced July 2021.
-
CentripetalText: An Efficient Text Instance Representation for Scene Text Detection
Authors:
Tao Sheng,
Jie Chen,
Zhouhui Lian
Abstract:
Scene text detection remains a grand challenge due to the variation in text curvatures, orientations, and aspect ratios. One of the hardest problems in this task is how to represent text instances of arbitrary shapes. Although many methods have been proposed to model irregular texts in a flexible manner, most of them lose simplicity and robustness. Their complicated post-processings and the regres…
▽ More
Scene text detection remains a grand challenge due to the variation in text curvatures, orientations, and aspect ratios. One of the hardest problems in this task is how to represent text instances of arbitrary shapes. Although many methods have been proposed to model irregular texts in a flexible manner, most of them lose simplicity and robustness. Their complicated post-processings and the regression under Dirac delta distribution undermine the detection performance and the generalization ability. In this paper, we propose an efficient text instance representation named CentripetalText (CT), which decomposes text instances into the combination of text kernels and centripetal shifts. Specifically, we utilize the centripetal shifts to implement pixel aggregation, guiding the external text pixels to the internal text kernels. The relaxation operation is integrated into the dense regression for centripetal shifts, allowing the correct prediction in a range instead of a specific value. The convenient reconstruction of text contours and the tolerance of prediction errors in our method guarantee the high detection accuracy and the fast inference speed, respectively. Besides, we shrink our text detector into a proposal generation module, namely CentripetalText Proposal Network, replacing Segmentation Proposal Network in Mask TextSpotter v3 and producing more accurate proposals. To validate the effectiveness of our method, we conduct experiments on several commonly used scene text benchmarks, including both curved and multi-oriented text datasets. For the task of scene text detection, our approach achieves superior or competitive performance compared to other existing methods, e.g., F-measure of 86.3% at 40.0 FPS on Total-Text, F-measure of 86.1% at 34.8 FPS on MSRA-TD500, etc. For the task of end-to-end scene text recognition, our method outperforms Mask TextSpotter v3 by 1.1% on Total-Text.
△ Less
Submitted 15 January, 2022; v1 submitted 13 July, 2021;
originally announced July 2021.
-
On Distance and Kernel Measures of Conditional Independence
Authors:
Tianhong Sheng,
Bharath K. Sriperumbudur
Abstract:
Measuring conditional independence is one of the important tasks in statistical inference and is fundamental in causal discovery, feature selection, dimensionality reduction, Bayesian network learning, and others. In this work, we explore the connection between conditional independence measures induced by distances on a metric space and reproducing kernels associated with a reproducing kernel Hilb…
▽ More
Measuring conditional independence is one of the important tasks in statistical inference and is fundamental in causal discovery, feature selection, dimensionality reduction, Bayesian network learning, and others. In this work, we explore the connection between conditional independence measures induced by distances on a metric space and reproducing kernels associated with a reproducing kernel Hilbert space (RKHS). For certain distance and kernel pairs, we show the distance-based conditional independence measures to be equivalent to that of kernel-based measures. On the other hand, we also show that some popular---in machine learning---kernel conditional independence measures based on the Hilbert-Schmidt norm of a certain cross-conditional covariance operator, do not have a simple distance representation, except in some limiting cases. This paper, therefore, shows the distance and kernel measures of conditional independence to be not quite equivalent unlike in the case of joint independence as shown by Sejdinovic et al. (2013).
△ Less
Submitted 17 August, 2020; v1 submitted 2 December, 2019;
originally announced December 2019.
-
Measurements of two-particle correlations in $e^+e^-$ collisions at 91 GeV with ALEPH archived data
Authors:
Anthony Badea,
Austin Baty,
Paoti Chang,
Gian Michele Innocenti,
Marcello Maggi,
Christopher McGinn,
Michael Peters,
Tzu-An Sheng,
Jesse Thaler,
Yen-Jie Lee
Abstract:
Measurements of two-particle angular correlations of charged particles emitted in hadronic $Z$ decays are presented. The archived $e^+e^-$ annihilation data at a center-of-mass energy of 91 GeV were collected with the ALEPH detector at LEP between 1992 and 1995. The correlation functions are measured over a broad range of pseudorapidity and full azimuth as a function of charged particle multiplici…
▽ More
Measurements of two-particle angular correlations of charged particles emitted in hadronic $Z$ decays are presented. The archived $e^+e^-$ annihilation data at a center-of-mass energy of 91 GeV were collected with the ALEPH detector at LEP between 1992 and 1995. The correlation functions are measured over a broad range of pseudorapidity and full azimuth as a function of charged particle multiplicity. No significant long-range correlation is observed in either the lab coordinate analysis or the thrust coordinate analysis, where the latter is sensitive to a medium expanding transverse to the color string between the outgoing $q\bar{q}$ pair from $Z$ boson decays. The associated yield distributions in both analyses are in better agreement with the prediction from the PYTHIA v6.1 event generator than from HERWIG v7.1.5. They provide new insights to showering and hadronization modeling. These results serve as an important reference to the observed long-range correlation in proton-proton, proton-nucleus, and nucleus-nucleus collisions.
△ Less
Submitted 26 November, 2019; v1 submitted 2 June, 2019;
originally announced June 2019.
-
Low-Power Computer Vision: Status, Challenges, Opportunities
Authors:
Sergei Alyamkin,
Matthew Ardi,
Alexander C. Berg,
Achille Brighton,
Bo Chen,
Yiran Chen,
Hsin-Pai Cheng,
Zichen Fan,
Chen Feng,
Bo Fu,
Kent Gauen,
Abhinav Goel,
Alexander Goncharenko,
Xuyang Guo,
Soonhoi Ha,
Andrew Howard,
Xiao Hu,
Yuanjun Huang,
Donghyun Kang,
Jaeyoun Kim,
Jong Gook Ko,
Alexander Kondratyev,
Junhyeok Lee,
Seungjae Lee,
Suwoong Lee
, et al. (19 additional authors not shown)
Abstract:
Computer vision has achieved impressive progress in recent years. Meanwhile, mobile phones have become the primary computing platforms for millions of people. In addition to mobile phones, many autonomous systems rely on visual data for making decisions and some of these systems have limited energy (such as unmanned aerial vehicles also called drones and mobile robots). These systems rely on batte…
▽ More
Computer vision has achieved impressive progress in recent years. Meanwhile, mobile phones have become the primary computing platforms for millions of people. In addition to mobile phones, many autonomous systems rely on visual data for making decisions and some of these systems have limited energy (such as unmanned aerial vehicles also called drones and mobile robots). These systems rely on batteries and energy efficiency is critical. This article serves two main purposes: (1) Examine the state-of-the-art for low-power solutions to detect objects in images. Since 2015, the IEEE Annual International Low-Power Image Recognition Challenge (LPIRC) has been held to identify the most energy-efficient computer vision solutions. This article summarizes 2018 winners' solutions. (2) Suggest directions for research as well as opportunities for low-power computer vision.
△ Less
Submitted 15 April, 2019;
originally announced April 2019.
-
Low Power Inference for On-Device Visual Recognition with a Quantization-Friendly Solution
Authors:
Chen Feng,
Tao Sheng,
Zhiyu Liang,
Shaojie Zhuo,
Xiaopeng Zhang,
Liang Shen,
Matthew Ardi,
Alexander C. Berg,
Yiran Chen,
Bo Chen,
Kent Gauen,
Yung-Hsiang Lu
Abstract:
The IEEE Low-Power Image Recognition Challenge (LPIRC) is an annual competition started in 2015 that encourages joint hardware and software solutions for computer vision systems with low latency and power. Track 1 of the competition in 2018 focused on the innovation of software solutions with fixed inference engine and hardware. This decision allows participants to submit models online and not wor…
▽ More
The IEEE Low-Power Image Recognition Challenge (LPIRC) is an annual competition started in 2015 that encourages joint hardware and software solutions for computer vision systems with low latency and power. Track 1 of the competition in 2018 focused on the innovation of software solutions with fixed inference engine and hardware. This decision allows participants to submit models online and not worry about building and bringing custom hardware on-site, which attracted a historically large number of submissions. Among the diverse solutions, the winning solution proposed a quantization-friendly framework for MobileNets that achieves an accuracy of 72.67% on the holdout dataset with an average latency of 27ms on a single CPU core of Google Pixel2 phone, which is superior to the best real-time MobileNet models at the time.
△ Less
Submitted 12 March, 2019;
originally announced March 2019.
-
M2Det: A Single-Shot Object Detector based on Multi-Level Feature Pyramid Network
Authors:
Qijie Zhao,
Tao Sheng,
Yongtao Wang,
Zhi Tang,
Ying Chen,
Ling Cai,
Haibin Ling
Abstract:
Feature pyramids are widely exploited by both the state-of-the-art one-stage object detectors (e.g., DSSD, RetinaNet, RefineDet) and the two-stage object detectors (e.g., Mask R-CNN, DetNet) to alleviate the problem arising from scale variation across object instances. Although these object detectors with feature pyramids achieve encouraging results, they have some limitations due to that they onl…
▽ More
Feature pyramids are widely exploited by both the state-of-the-art one-stage object detectors (e.g., DSSD, RetinaNet, RefineDet) and the two-stage object detectors (e.g., Mask R-CNN, DetNet) to alleviate the problem arising from scale variation across object instances. Although these object detectors with feature pyramids achieve encouraging results, they have some limitations due to that they only simply construct the feature pyramid according to the inherent multi-scale, pyramidal architecture of the backbones which are actually designed for object classification task. Newly, in this work, we present a method called Multi-Level Feature Pyramid Network (MLFPN) to construct more effective feature pyramids for detecting objects of different scales. First, we fuse multi-level features (i.e. multiple layers) extracted by backbone as the base feature. Second, we feed the base feature into a block of alternating joint Thinned U-shape Modules and Feature Fusion Modules and exploit the decoder layers of each u-shape module as the features for detecting objects. Finally, we gather up the decoder layers with equivalent scales (sizes) to develop a feature pyramid for object detection, in which every feature map consists of the layers (features) from multiple levels. To evaluate the effectiveness of the proposed MLFPN, we design and train a powerful end-to-end one-stage object detector we call M2Det by integrating it into the architecture of SSD, which gets better detection performance than state-of-the-art one-stage detectors. Specifically, on MS-COCO benchmark, M2Det achieves AP of 41.0 at speed of 11.8 FPS with single-scale inference strategy and AP of 44.2 with multi-scale inference strategy, which is the new state-of-the-art results among one-stage detectors. The code will be made available on \url{https://github.com/qijiezhao/M2Det.
△ Less
Submitted 6 January, 2019; v1 submitted 11 November, 2018;
originally announced November 2018.
-
2018 Low-Power Image Recognition Challenge
Authors:
Sergei Alyamkin,
Matthew Ardi,
Achille Brighton,
Alexander C. Berg,
Yiran Chen,
Hsin-Pai Cheng,
Bo Chen,
Zichen Fan,
Chen Feng,
Bo Fu,
Kent Gauen,
Jongkook Go,
Alexander Goncharenko,
Xuyang Guo,
Hong Hanh Nguyen,
Andrew Howard,
Yuanjun Huang,
Donghyun Kang,
Jaeyoun Kim,
Alexander Kondratyev,
Seungjae Lee,
Suwoong Lee,
Junhyeok Lee,
Zhiyu Liang,
Xin Liu
, et al. (16 additional authors not shown)
Abstract:
The Low-Power Image Recognition Challenge (LPIRC, https://rebootingcomputing.ieee.org/lpirc) is an annual competition started in 2015. The competition identifies the best technologies that can classify and detect objects in images efficiently (short execution time and low energy consumption) and accurately (high precision). Over the four years, the winners' scores have improved more than 24 times.…
▽ More
The Low-Power Image Recognition Challenge (LPIRC, https://rebootingcomputing.ieee.org/lpirc) is an annual competition started in 2015. The competition identifies the best technologies that can classify and detect objects in images efficiently (short execution time and low energy consumption) and accurately (high precision). Over the four years, the winners' scores have improved more than 24 times. As computer vision is widely used in many battery-powered systems (such as drones and mobile phones), the need for low-power computer vision will become increasingly important. This paper summarizes LPIRC 2018 by describing the three different tracks and the winners' solutions.
△ Less
Submitted 3 October, 2018;
originally announced October 2018.
-
CFENet: An Accurate and Efficient Single-Shot Object Detector for Autonomous Driving
Authors:
Qijie Zhao,
Tao Sheng,
Yongtao Wang,
Feng Ni,
Ling Cai
Abstract:
The ability to detect small objects and the speed of the object detector are very important for the application of autonomous driving, and in this paper, we propose an effective yet efficient one-stage detector, which gained the second place in the Road Object Detection competition of CVPR2018 workshop - Workshop of Autonomous Driving(WAD). The proposed detector inherits the architecture of SSD an…
▽ More
The ability to detect small objects and the speed of the object detector are very important for the application of autonomous driving, and in this paper, we propose an effective yet efficient one-stage detector, which gained the second place in the Road Object Detection competition of CVPR2018 workshop - Workshop of Autonomous Driving(WAD). The proposed detector inherits the architecture of SSD and introduces a novel Comprehensive Feature Enhancement(CFE) module into it. Experimental results on this competition dataset as well as the MSCOCO dataset demonstrate that the proposed detector (named CFENet) performs much better than the original SSD and the state-of-the-art method RefineDet especially for small objects, while keeping high efficiency close to the original SSD. Specifically, the single scale version of the proposed detector can run at the speed of 21 fps, while the multi-scale version with larger input size achieves the mAP 29.69, ranking second on the leaderboard
△ Less
Submitted 10 October, 2018; v1 submitted 26 June, 2018;
originally announced June 2018.
-
Experimental distillation of bi-partite polarization entanglement using polarizing Mach-Zehnder interferometers
Authors:
Chithrabhanu Perumangatt,
Tang Zong Sheng,
Alexander Ling
Abstract:
Entanglement distillation is the process of concentrating entanglement from a given quantum state. We present a technique for distillation of bi-partite polarization entanglement using interferometry. This technique can be optimized to extract maximal entanglement from any pure or mixed entangled state. A model for this method is presented and in particular we present experimental results for pure…
▽ More
Entanglement distillation is the process of concentrating entanglement from a given quantum state. We present a technique for distillation of bi-partite polarization entanglement using interferometry. This technique can be optimized to extract maximal entanglement from any pure or mixed entangled state. A model for this method is presented and in particular we present experimental results for pure states when using polarizing Mach-Zehnder interferometers. These experimentally distilled states always demonstrate an increased violation of Bell's inequality.
△ Less
Submitted 17 May, 2018;
originally announced May 2018.
-
A Quantization-Friendly Separable Convolution for MobileNets
Authors:
Tao Sheng,
Chen Feng,
Shaojie Zhuo,
Xiaopeng Zhang,
Liang Shen,
Mickey Aleksic
Abstract:
As deep learning (DL) is being rapidly pushed to edge computing, researchers invented various ways to make inference computation more efficient on mobile/IoT devices, such as network pruning, parameter compression, and etc. Quantization, as one of the key approaches, can effectively offload GPU, and make it possible to deploy DL on fixed-point pipeline. Unfortunately, not all existing networks des…
▽ More
As deep learning (DL) is being rapidly pushed to edge computing, researchers invented various ways to make inference computation more efficient on mobile/IoT devices, such as network pruning, parameter compression, and etc. Quantization, as one of the key approaches, can effectively offload GPU, and make it possible to deploy DL on fixed-point pipeline. Unfortunately, not all existing networks design are friendly to quantization. For example, the popular lightweight MobileNetV1, while it successfully reduces parameter size and computation latency with separable convolution, our experiment shows its quantized models have large accuracy gap against its float point models. To resolve this, we analyzed the root cause of quantization loss and proposed a quantization-friendly separable convolution architecture. By evaluating the image classification task on ImageNet2012 dataset, our modified MobileNetV1 model can archive 8-bit inference top-1 accuracy in 68.03%, almost closed the gap to the float pipeline.
△ Less
Submitted 12 March, 2019; v1 submitted 22 March, 2018;
originally announced March 2018.
-
Fault-tolerant and finite-error localization for point emitters within the diffraction limit
Authors:
Tang Zong Sheng,
Kadir Durak,
Alexander Ling
Abstract:
We implement an estimator for determining the separation between two incoherent point sources. This estimator relies on image inversion interferometry and when used with the appropriate data analytics, it yields an estimate of the separation with finite-error, even when the sources come arbitrarily close together. The experimental results show that the technique has a good tolerance to noise and m…
▽ More
We implement an estimator for determining the separation between two incoherent point sources. This estimator relies on image inversion interferometry and when used with the appropriate data analytics, it yields an estimate of the separation with finite-error, even when the sources come arbitrarily close together. The experimental results show that the technique has a good tolerance to noise and misalignment, making it an interesting consideration for high resolution instruments.
△ Less
Submitted 30 May, 2016; v1 submitted 24 May, 2016;
originally announced May 2016.