-
LCB-CV-UNet: Enhanced Detector for High Dynamic Range Radar Signals
Authors:
Yanbin Wang,
Xingyu Chen,
Yumiao Wang,
Xiang Wang,
Chuanfei Zang,
Guolong Cui,
Jiahuan Liu
Abstract:
We propose the LCB-CV-UNet to tackle performance degradation caused by High Dynamic Range (HDR) radar signals. Initially, a hardware-efficient, plug-and-play module named Logarithmic Connect Block (LCB) is proposed as a phase coherence preserving solution to address the inherent challenges in handling HDR features. Then, we propose the Dual Hybrid Dataset Construction method to generate a semi-syn…
▽ More
We propose the LCB-CV-UNet to tackle performance degradation caused by High Dynamic Range (HDR) radar signals. Initially, a hardware-efficient, plug-and-play module named Logarithmic Connect Block (LCB) is proposed as a phase coherence preserving solution to address the inherent challenges in handling HDR features. Then, we propose the Dual Hybrid Dataset Construction method to generate a semi-synthetic dataset, approximating typical HDR signal scenarios with adjustable target distributions. Simulation results show about 1% total detection probability improvement with under 0.9% computational complexity added compared with the baseline. Furthermore, it excels 5% over the baseline at the range in 11-13 dB signal-to-noise ratio typical for urban targets. Finally, the real experiment validates the practicality of our model.
△ Less
Submitted 29 May, 2025;
originally announced May 2025.
-
Federated Causal Inference in Healthcare: Methods, Challenges, and Applications
Authors:
Haoyang Li,
Jie Xu,
Kyra Gan,
Fei Wang,
Chengxi Zang
Abstract:
Federated causal inference enables multi-site treatment effect estimation without sharing individual-level data, offering a privacy-preserving solution for real-world evidence generation. However, data heterogeneity across sites, manifested in differences in covariate, treatment, and outcome, poses significant challenges for unbiased and efficient estimation. In this paper, we present a comprehens…
▽ More
Federated causal inference enables multi-site treatment effect estimation without sharing individual-level data, offering a privacy-preserving solution for real-world evidence generation. However, data heterogeneity across sites, manifested in differences in covariate, treatment, and outcome, poses significant challenges for unbiased and efficient estimation. In this paper, we present a comprehensive review and theoretical analysis of federated causal effect estimation across both binary/continuous and time-to-event outcomes. We classify existing methods into weight-based strategies and optimization-based frameworks and further discuss extensions including personalized models, peer-to-peer communication, and model decomposition. For time-to-event outcomes, we examine federated Cox and Aalen-Johansen models, deriving asymptotic bias and variance under heterogeneity. Our analysis reveals that FedProx-style regularization achieves near-optimal bias-variance trade-offs compared to naive averaging and meta-analysis. We review related software tools and conclude by outlining opportunities, challenges, and future directions for scalable, fair, and trustworthy federated causal inference in distributed healthcare systems.
△ Less
Submitted 4 May, 2025;
originally announced May 2025.
-
Dynamics of rotating helices in viscous fluid
Authors:
Chijing Zang,
Luke Omodt,
Moumita Dasgupta,
Xiang Cheng
Abstract:
We investigate the dynamics of a pair of rigid rotating helices in a viscous fluid, as a model for bacterial flagellar bundle and a prototype of microfluidic pumps. Combining experiments with hydrodynamic modeling, we examine how spacing and phase difference between the two helices affect their torque, flow field and fluid transport capacity at low Reynolds numbers. Hydrodynamic coupling reduces t…
▽ More
We investigate the dynamics of a pair of rigid rotating helices in a viscous fluid, as a model for bacterial flagellar bundle and a prototype of microfluidic pumps. Combining experiments with hydrodynamic modeling, we examine how spacing and phase difference between the two helices affect their torque, flow field and fluid transport capacity at low Reynolds numbers. Hydrodynamic coupling reduces the torque when the helices rotate in phase at constant angular speed, but increases the torque when they rotate out of phase. We identify a critical phase difference, at which the hydrodynamic coupling vanishes despite the close spacing between the helices. A simple model, based on the flow characteristics and positioning of a single helix, is constructed, which quantitatively predicts the torque of the helical pair in both unbounded and confined systems. Lastly, we show the influence of spacing and phase difference on the axial flux and the pump efficiency of the helices. Our findings shed light on the function of bacterial flagella and provide design principles for efficient low-Reynolds-number pumps.
△ Less
Submitted 9 May, 2025; v1 submitted 12 April, 2025;
originally announced April 2025.
-
AI-Powered Urban Transportation Digital Twin: Methods and Applications
Authors:
Xuan Di,
Yongjie Fu,
Mehmet K. Turkcan,
Mahshid Ghasemi,
Zhaobin Mo,
Chengbo Zang,
Abhishek Adhikari,
Zoran Kostic,
Gil Zussman
Abstract:
We present a survey paper on methods and applications of digital twins (DT) for urban traffic management. While the majority of studies on the DT focus on its "eyes," which is the emerging sensing and perception like object detection and tracking, what really distinguishes the DT from a traditional simulator lies in its ``brain," the prediction and decision making capabilities of extracting patter…
▽ More
We present a survey paper on methods and applications of digital twins (DT) for urban traffic management. While the majority of studies on the DT focus on its "eyes," which is the emerging sensing and perception like object detection and tracking, what really distinguishes the DT from a traditional simulator lies in its ``brain," the prediction and decision making capabilities of extracting patterns and making informed decisions from what has been seen and perceived. In order to add values to urban transportation management, DTs need to be powered by artificial intelligence and complement with low-latency high-bandwidth sensing and networking technologies. We will first review the DT pipeline leveraging cyberphysical systems and propose our DT architecture deployed on a real-world testbed in New York City. This survey paper can be a pointer to help researchers and practitioners identify challenges and opportunities for the development of DTs; a bridge to initiate conversations across disciplines; and a road map to exploiting potentials of DTs for diverse urban transportation applications.
△ Less
Submitted 29 December, 2024;
originally announced January 2025.
-
The Streetscape Application Services Stack (SASS): Towards a Distributed Sensing Architecture for Urban Applications
Authors:
Navid Salami Pargoo,
Mahshid Ghasemi,
Shuren Xia,
Mehmet Kerem Turkcan,
Taqiya Ehsan,
Chengbo Zang,
Yuan Sun,
Javad Ghaderi,
Gil Zussman,
Zoran Kostic,
Jorge Ortiz
Abstract:
As urban populations grow, cities are becoming more complex, driving the deployment of interconnected sensing systems to realize the vision of smart cities. These systems aim to improve safety, mobility, and quality of life through applications that integrate diverse sensors with real-time decision-making. Streetscape applications-focusing on challenges like pedestrian safety and adaptive traffic…
▽ More
As urban populations grow, cities are becoming more complex, driving the deployment of interconnected sensing systems to realize the vision of smart cities. These systems aim to improve safety, mobility, and quality of life through applications that integrate diverse sensors with real-time decision-making. Streetscape applications-focusing on challenges like pedestrian safety and adaptive traffic management-depend on managing distributed, heterogeneous sensor data, aligning information across time and space, and enabling real-time processing. These tasks are inherently complex and often difficult to scale. The Streetscape Application Services Stack (SASS) addresses these challenges with three core services: multimodal data synchronization, spatiotemporal data fusion, and distributed edge computing. By structuring these capabilities as clear, composable abstractions with clear semantics, SASS allows developers to scale streetscape applications efficiently while minimizing the complexity of multimodal integration.
We evaluated SASS in two real-world testbed environments: a controlled parking lot and an urban intersection in a major U.S. city. These testbeds allowed us to test SASS under diverse conditions, demonstrating its practical applicability. The Multimodal Data Synchronization service reduced temporal misalignment errors by 88%, achieving synchronization accuracy within 50 milliseconds. Spatiotemporal Data Fusion service improved detection accuracy for pedestrians and vehicles by over 10%, leveraging multicamera integration. The Distributed Edge Computing service increased system throughput by more than an order of magnitude. Together, these results show how SASS provides the abstractions and performance needed to support real-time, scalable urban applications, bridging the gap between sensing infrastructure and actionable streetscape intelligence.
△ Less
Submitted 12 January, 2025; v1 submitted 29 November, 2024;
originally announced November 2024.
-
Truth or Deceit? A Bayesian Decoding Game Enhances Consistency and Reliability
Authors:
Weitong Zhang,
Chengqi Zang,
Bernhard Kainz
Abstract:
Large Language Models (LLMs) often produce outputs that -- though plausible -- can lack consistency and reliability, particularly in ambiguous or complex scenarios. Challenges arise from ensuring that outputs align with both factual correctness and human intent. This is problematic in existing approaches that trade improved consistency for lower accuracy. To mitigate these challenges, we propose a…
▽ More
Large Language Models (LLMs) often produce outputs that -- though plausible -- can lack consistency and reliability, particularly in ambiguous or complex scenarios. Challenges arise from ensuring that outputs align with both factual correctness and human intent. This is problematic in existing approaches that trade improved consistency for lower accuracy. To mitigate these challenges, we propose a novel game-theoretic approach to enhance consistency and reliability during the decoding stage of LLM output generation. Our method models the decoding process as a multistage Bayesian decoding game. This ensures consistency through Correctness Alignment and enhances reliability via Ambiguity Calibration. The model dynamically converges to a consensus on the most reliable outputs and distinguishes {Valid, Specious} outputs without human feedback or additional training. Our game design allows smaller models to outperform much larger models through game mechanisms (e.g., 78.1 LLaMA13B vs 76.6 PaLM540B), as well as integrating various LL strategies and models, demonstrating the potential of game-theoretic tools to improve the truthfulness and reliability of LLMs.
△ Less
Submitted 1 October, 2024;
originally announced October 2024.
-
Boundless: Generating Photorealistic Synthetic Data for Object Detection in Urban Streetscapes
Authors:
Mehmet Kerem Turkcan,
Yuyang Li,
Chengbo Zang,
Javad Ghaderi,
Gil Zussman,
Zoran Kostic
Abstract:
We introduce Boundless, a photo-realistic synthetic data generation system for enabling highly accurate object detection in dense urban streetscapes. Boundless can replace massive real-world data collection and manual ground-truth object annotation (labeling) with an automated and configurable process. Boundless is based on the Unreal Engine 5 (UE5) City Sample project with improvements enabling a…
▽ More
We introduce Boundless, a photo-realistic synthetic data generation system for enabling highly accurate object detection in dense urban streetscapes. Boundless can replace massive real-world data collection and manual ground-truth object annotation (labeling) with an automated and configurable process. Boundless is based on the Unreal Engine 5 (UE5) City Sample project with improvements enabling accurate collection of 3D bounding boxes across different lighting and scene variability conditions.
We evaluate the performance of object detection models trained on the dataset generated by Boundless when used for inference on a real-world dataset acquired from medium-altitude cameras. We compare the performance of the Boundless-trained model against the CARLA-trained model and observe an improvement of 7.8 mAP. The results we achieved support the premise that synthetic data generation is a credible methodology for training/fine-tuning scalable object detection models for urban scenes.
△ Less
Submitted 26 September, 2024; v1 submitted 4 September, 2024;
originally announced September 2024.
-
Data-Driven Traffic Simulation for an Intersection in a Metropolis
Authors:
Chengbo Zang,
Mehmet Kerem Turkcan,
Gil Zussman,
Javad Ghaderi,
Zoran Kostic
Abstract:
We present a novel data-driven simulation environment for modeling traffic in metropolitan street intersections. Using real-world tracking data collected over an extended period of time, we train trajectory forecasting models to learn agent interactions and environmental constraints that are difficult to capture conventionally. Trajectories of new agents are first coarsely generated by sampling fr…
▽ More
We present a novel data-driven simulation environment for modeling traffic in metropolitan street intersections. Using real-world tracking data collected over an extended period of time, we train trajectory forecasting models to learn agent interactions and environmental constraints that are difficult to capture conventionally. Trajectories of new agents are first coarsely generated by sampling from the spatial and temporal generative distributions, then refined using state-of-the-art trajectory forecasting models. The simulation can run either autonomously, or under explicit human control conditioned on the generative distributions. We present the experiments for a variety of model configurations. Under an iterative prediction scheme, the way-point-supervised TrajNet++ model obtained 0.36 Final Displacement Error (FDE) in 20 FPS on an NVIDIA A100 GPU.
△ Less
Submitted 1 August, 2024;
originally announced August 2024.
-
Stability and Generalizability in SDE Diffusion Models with Measure-Preserving Dynamics
Authors:
Weitong Zhang,
Chengqi Zang,
Liu Li,
Sarah Cechnicka,
Cheng Ouyang,
Bernhard Kainz
Abstract:
Inverse problems describe the process of estimating the causal factors from a set of measurements or data. Mapping of often incomplete or degraded data to parameters is ill-posed, thus data-driven iterative solutions are required, for example when reconstructing clean images from poor signals. Diffusion models have shown promise as potent generative tools for solving inverse problems due to their…
▽ More
Inverse problems describe the process of estimating the causal factors from a set of measurements or data. Mapping of often incomplete or degraded data to parameters is ill-posed, thus data-driven iterative solutions are required, for example when reconstructing clean images from poor signals. Diffusion models have shown promise as potent generative tools for solving inverse problems due to their superior reconstruction quality and their compatibility with iterative solvers. However, most existing approaches are limited to linear inverse problems represented as Stochastic Differential Equations (SDEs). This simplification falls short of addressing the challenging nature of real-world problems, leading to amplified cumulative errors and biases. We provide an explanation for this gap through the lens of measure-preserving dynamics of Random Dynamical Systems (RDS) with which we analyse Temporal Distribution Discrepancy and thus introduce a theoretical framework based on RDS for SDE diffusion models. We uncover several strategies that inherently enhance the stability and generalizability of diffusion models for inverse problems and introduce a novel score-based diffusion framework, the \textbf{D}ynamics-aware S\textbf{D}E \textbf{D}iffusion \textbf{G}enerative \textbf{M}odel (D$^3$GM). The \textit{Measure-preserving property} can return the degraded measurement to the original state despite complex degradation with the RDS concept of \textit{stability}. Our extensive experimental results corroborate the effectiveness of D$^3$GM across multiple benchmarks including a prominent application for inverse problems, magnetic resonance imaging. Code and data will be publicly available.
△ Less
Submitted 19 June, 2024;
originally announced June 2024.
-
Constellation Dataset: Benchmarking High-Altitude Object Detection for an Urban Intersection
Authors:
Mehmet Kerem Turkcan,
Sanjeev Narasimhan,
Chengbo Zang,
Gyung Hyun Je,
Bo Yu,
Mahshid Ghasemi,
Javad Ghaderi,
Gil Zussman,
Zoran Kostic
Abstract:
We introduce Constellation, a dataset of 13K images suitable for research on detection of objects in dense urban streetscapes observed from high-elevation cameras, collected for a variety of temporal conditions. The dataset addresses the need for curated data to explore problems in small object detection exemplified by the limited pixel footprint of pedestrians observed tens of meters from above.…
▽ More
We introduce Constellation, a dataset of 13K images suitable for research on detection of objects in dense urban streetscapes observed from high-elevation cameras, collected for a variety of temporal conditions. The dataset addresses the need for curated data to explore problems in small object detection exemplified by the limited pixel footprint of pedestrians observed tens of meters from above. It enables the testing of object detection models for variations in lighting, building shadows, weather, and scene dynamics. We evaluate contemporary object detection architectures on the dataset, observing that state-of-the-art methods have lower performance in detecting small pedestrians compared to vehicles, corresponding to a 10% difference in average precision (AP). Using structurally similar datasets for pretraining the models results in an increase of 1.8% mean AP (mAP). We further find that incorporating domain-specific data augmentations helps improve model performance. Using pseudo-labeled data, obtained from inference outcomes of the best-performing models, improves the performance of the models. Finally, comparing the models trained using the data collected in two different time intervals, we find a performance drift in models due to the changes in intersection conditions over time. The best-performing model achieves a pedestrian AP of 92.0% with 11.5 ms inference time on NVIDIA A100 GPUs, and an mAP of 95.4%.
△ Less
Submitted 25 April, 2024;
originally announced April 2024.
-
Detection of circular permutations by Protein Language Models
Authors:
Yue Hu,
Bin Huang,
Chunzi Zang
Abstract:
Protein circular permutations are crucial for understanding protein evolution and functionality. Traditional detection methods, sequence-based or structure-based, struggle with accuracy and computational efficiency, the latter also limited by treating proteins as rigid bodies. The plmCP method, utilizing a protein language model, not only speeds up the detection process but also enhances the accur…
▽ More
Protein circular permutations are crucial for understanding protein evolution and functionality. Traditional detection methods, sequence-based or structure-based, struggle with accuracy and computational efficiency, the latter also limited by treating proteins as rigid bodies. The plmCP method, utilizing a protein language model, not only speeds up the detection process but also enhances the accuracy of identifying circular permutations, contributing significantly to protein research and engineering by acknowledging structural flexibility.
△ Less
Submitted 6 August, 2024; v1 submitted 23 April, 2024;
originally announced April 2024.
-
QNCD: Quantization Noise Correction for Diffusion Models
Authors:
Huanpeng Chu,
Wei Wu,
Chengjie Zang,
Kun Yuan
Abstract:
Diffusion models have revolutionized image synthesis, setting new benchmarks in quality and creativity. However, their widespread adoption is hindered by the intensive computation required during the iterative denoising process. Post-training quantization (PTQ) presents a solution to accelerate sampling, aibeit at the expense of sample quality, extremely in low-bit settings. Addressing this, our s…
▽ More
Diffusion models have revolutionized image synthesis, setting new benchmarks in quality and creativity. However, their widespread adoption is hindered by the intensive computation required during the iterative denoising process. Post-training quantization (PTQ) presents a solution to accelerate sampling, aibeit at the expense of sample quality, extremely in low-bit settings. Addressing this, our study introduces a unified Quantization Noise Correction Scheme (QNCD), aimed at minishing quantization noise throughout the sampling process. We identify two primary quantization challenges: intra and inter quantization noise. Intra quantization noise, mainly exacerbated by embeddings in the resblock module, extends activation quantization ranges, increasing disturbances in each single denosing step. Besides, inter quantization noise stems from cumulative quantization deviations across the entire denoising process, altering data distributions step-by-step. QNCD combats these through embedding-derived feature smoothing for eliminating intra quantization noise and an effective runtime noise estimatiation module for dynamicly filtering inter quantization noise. Extensive experiments demonstrate that our method outperforms previous quantization methods for diffusion models, achieving lossless results in W4A8 and W8A8 quantization settings on ImageNet (LDM-4). Code is available at: https://github.com/huanpengchu/QNCD
△ Less
Submitted 18 September, 2024; v1 submitted 28 March, 2024;
originally announced March 2024.
-
Let Storytelling Tell Vivid Stories: An Expressive and Fluent Multimodal Storyteller
Authors:
Chuanqi Zang,
Jiji Tang,
Rongsheng Zhang,
Zeng Zhao,
Tangjie Lv,
Mingtao Pei,
Wei Liang
Abstract:
Storytelling aims to generate reasonable and vivid narratives based on an ordered image stream. The fidelity to the image story theme and the divergence of story plots attract readers to keep reading. Previous works iteratively improved the alignment of multiple modalities but ultimately resulted in the generation of simplistic storylines for image streams. In this work, we propose a new pipeline,…
▽ More
Storytelling aims to generate reasonable and vivid narratives based on an ordered image stream. The fidelity to the image story theme and the divergence of story plots attract readers to keep reading. Previous works iteratively improved the alignment of multiple modalities but ultimately resulted in the generation of simplistic storylines for image streams. In this work, we propose a new pipeline, termed LLaMS, to generate multimodal human-level stories that are embodied in expressiveness and consistency. Specifically, by fully exploiting the commonsense knowledge within the LLM, we first employ a sequence data auto-enhancement strategy to enhance factual content expression and leverage a textual reasoning architecture for expressive story generation and prediction. Secondly, we propose SQ-Adatpter module for story illustration generation which can maintain sequence consistency. Numerical results are conducted through human evaluation to verify the superiority of proposed LLaMS. Evaluations show that LLaMS achieves state-of-the-art storytelling performance and 86% correlation and 100% consistency win rate as compared with previous SOTA methods. Furthermore, ablation experiments are conducted to verify the effectiveness of proposed sequence data enhancement and SQ-Adapter.
△ Less
Submitted 12 March, 2024;
originally announced March 2024.
-
Emerging Opportunities of Using Large Language Models for Translation Between Drug Molecules and Indications
Authors:
David Oniani,
Jordan Hilsman,
Chengxi Zang,
Junmei Wang,
Lianjin Cai,
Jan Zawala,
Yanshan Wang
Abstract:
A drug molecule is a substance that changes the organism's mental or physical state. Every approved drug has an indication, which refers to the therapeutic use of that drug for treating a particular medical condition. While the Large Language Model (LLM), a generative Artificial Intelligence (AI) technique, has recently demonstrated effectiveness in translating between molecules and their textual…
▽ More
A drug molecule is a substance that changes the organism's mental or physical state. Every approved drug has an indication, which refers to the therapeutic use of that drug for treating a particular medical condition. While the Large Language Model (LLM), a generative Artificial Intelligence (AI) technique, has recently demonstrated effectiveness in translating between molecules and their textual descriptions, there remains a gap in research regarding their application in facilitating the translation between drug molecules and indications, or vice versa, which could greatly benefit the drug discovery process. The capability of generating a drug from a given indication would allow for the discovery of drugs targeting specific diseases or targets and ultimately provide patients with better treatments. In this paper, we first propose a new task, which is the translation between drug molecules and corresponding indications, and then test existing LLMs on this new task. Specifically, we consider nine variations of the T5 LLM and evaluate them on two public datasets obtained from ChEMBL and DrugBank. Our experiments show the early results of using LLMs for this task and provide a perspective on the state-of-the-art. We also emphasize the current limitations and discuss future work that has the potential to improve the performance on this task. The creation of molecules from indications, or vice versa, will allow for more efficient targeting of diseases and significantly reduce the cost of drug discovery, with the potential to revolutionize the field of drug discovery in the era of generative AI.
△ Less
Submitted 16 February, 2024; v1 submitted 14 February, 2024;
originally announced February 2024.
-
Assessing univariate and bivariate risks of late-frost and drought using vine copulas: A historical study for Bavaria
Authors:
Marija Tepegjozova,
Benjamin F. Meyer,
Anja Rammig,
Christian S. Zang,
Claudia Czado
Abstract:
In light of climate change's impacts on forests, including extreme drought and late-frost, leading to vitality decline and regional forest die-back, we assess univariate drought and late-frost risks and perform a joint risk analysis in Bavaria, Germany, from 1952 to 2020. Utilizing a vast dataset with 26 bioclimatic and topographic variables, we employ vine copula models due to the data's non-Gaus…
▽ More
In light of climate change's impacts on forests, including extreme drought and late-frost, leading to vitality decline and regional forest die-back, we assess univariate drought and late-frost risks and perform a joint risk analysis in Bavaria, Germany, from 1952 to 2020. Utilizing a vast dataset with 26 bioclimatic and topographic variables, we employ vine copula models due to the data's non-Gaussian and asymmetric dependencies. We use D-vine regression for univariate and Y-vine regression for bivariate analysis, and propose corresponding univariate and bivariate conditional probability risk measures. We identify "at-risk" regions, emphasizing the need for forest adaptation due to climate change.
△ Less
Submitted 16 October, 2023;
originally announced October 2023.
-
A Novel Dual-pooling Attention Module for UAV Vehicle Re-identification
Authors:
Xiaoyan Guo,
Jie Yang,
Xinyu Jia,
Chuanyan Zang,
Yan Xu,
Zhaoyang Chen
Abstract:
Vehicle re-identification (Re-ID) involves identifying the same vehicle captured by other cameras, given a vehicle image. It plays a crucial role in the development of safe cities and smart cities. With the rapid growth and implementation of unmanned aerial vehicles (UAVs) technology, vehicle Re-ID in UAV aerial photography scenes has garnered significant attention from researchers. However, due t…
▽ More
Vehicle re-identification (Re-ID) involves identifying the same vehicle captured by other cameras, given a vehicle image. It plays a crucial role in the development of safe cities and smart cities. With the rapid growth and implementation of unmanned aerial vehicles (UAVs) technology, vehicle Re-ID in UAV aerial photography scenes has garnered significant attention from researchers. However, due to the high altitude of UAVs, the shooting angle of vehicle images sometimes approximates vertical, resulting in fewer local features for Re-ID. Therefore, this paper proposes a novel dual-pooling attention (DpA) module, which achieves the extraction and enhancement of locally important information about vehicles from both channel and spatial dimensions by constructing two branches of channel-pooling attention (CpA) and spatial-pooling attention (SpA), and employing multiple pooling operations to enhance the attention to fine-grained information of vehicles. Specifically, the CpA module operates between the channels of the feature map and splices features by combining four pooling operations so that vehicle regions containing discriminative information are given greater attention. The SpA module uses the same pooling operations strategy to identify discriminative representations and merge vehicle features in image regions in a weighted manner. The feature information of both dimensions is finally fused and trained jointly using label smoothing cross-entropy loss and hard mining triplet loss, thus solving the problem of missing detail information due to the high height of UAV shots. The proposed method's effectiveness is demonstrated through extensive experiments on the UAV-based vehicle datasets VeRi-UAV and VRU.
△ Less
Submitted 24 June, 2023;
originally announced June 2023.
-
Reinforcement Learning for Solving Robotic Reaching Tasks in the Neurorobotics Platform
Authors:
Marton Szep,
Leander Lauenburg,
Kevin Farkas,
Xiyan Su,
Chuanlong Zang
Abstract:
In recent years, reinforcement learning (RL) has shown great potential for solving tasks in well-defined environments like games or robotics. This paper aims to solve the robotic reaching task in a simulation run on the Neurorobotics Platform (NRP). The target position is initialized randomly and the robot has 6 degrees of freedom. We compare the performance of various state-of-the-art model-free…
▽ More
In recent years, reinforcement learning (RL) has shown great potential for solving tasks in well-defined environments like games or robotics. This paper aims to solve the robotic reaching task in a simulation run on the Neurorobotics Platform (NRP). The target position is initialized randomly and the robot has 6 degrees of freedom. We compare the performance of various state-of-the-art model-free algorithms. At first, the agent is trained on ground truth data from the simulation to reach the target position in only one continuous movement. Later the complexity of the task is increased by using image data as input from the simulation environment. Experimental results show that training efficiency and results can be improved with appropriate dynamic training schedule function for curriculum learning.
△ Less
Submitted 31 October, 2022;
originally announced October 2022.
-
SCEHR: Supervised Contrastive Learning for Clinical Risk Prediction using Electronic Health Records
Authors:
Chengxi Zang,
Fei Wang
Abstract:
Contrastive learning has demonstrated promising performance in image and text domains either in a self-supervised or a supervised manner. In this work, we extend the supervised contrastive learning framework to clinical risk prediction problems based on longitudinal electronic health records (EHR). We propose a general supervised contrastive loss…
▽ More
Contrastive learning has demonstrated promising performance in image and text domains either in a self-supervised or a supervised manner. In this work, we extend the supervised contrastive learning framework to clinical risk prediction problems based on longitudinal electronic health records (EHR). We propose a general supervised contrastive loss $\mathcal{L}_{\text{Contrastive Cross Entropy} } + λ\mathcal{L}_{\text{Supervised Contrastive Regularizer}}$ for learning both binary classification (e.g. in-hospital mortality prediction) and multi-label classification (e.g. phenotyping) in a unified framework. Our supervised contrastive loss practices the key idea of contrastive learning, namely, pulling similar samples closer and pushing dissimilar ones apart from each other, simultaneously by its two components: $\mathcal{L}_{\text{Contrastive Cross Entropy} }$ tries to contrast samples with learned anchors which represent positive and negative clusters, and $\mathcal{L}_{\text{Supervised Contrastive Regularizer}}$ tries to contrast samples with each other according to their supervised labels. We propose two versions of the above supervised contrastive loss and our experiments on real-world EHR data demonstrate that our proposed loss functions show benefits in improving the performance of strong baselines and even state-of-the-art models on benchmarking tasks for clinical risk predictions. Our loss functions work well with extremely imbalanced data which are common for clinical risk prediction problems. Our loss functions can be easily used to replace (binary or multi-label) cross-entropy loss adopted in existing clinical predictive models. The Pytorch code is released at \url{https://github.com/calvin-zcx/SCEHR}.
△ Less
Submitted 10 October, 2021;
originally announced October 2021.
-
Contrastive Learning Improves Critical Event Prediction in COVID-19 Patients
Authors:
Tingyi Wanyan,
Hossein Honarvar,
Suraj K. Jaladanki,
Chengxi Zang,
Nidhi Naik,
Sulaiman Somani,
Jessica K. De Freitas,
Ishan Paranjpe,
Akhil Vaid,
Riccardo Miotto,
Girish N. Nadkarni,
Marinka Zitnik,
ArifulAzad,
Fei Wang,
Ying Ding,
Benjamin S. Glicksberg
Abstract:
Machine Learning (ML) models typically require large-scale, balanced training data to be robust, generalizable, and effective in the context of healthcare. This has been a major issue for developing ML models for the coronavirus-disease 2019 (COVID-19) pandemic where data is highly imbalanced, particularly within electronic health records (EHR) research. Conventional approaches in ML use cross-ent…
▽ More
Machine Learning (ML) models typically require large-scale, balanced training data to be robust, generalizable, and effective in the context of healthcare. This has been a major issue for developing ML models for the coronavirus-disease 2019 (COVID-19) pandemic where data is highly imbalanced, particularly within electronic health records (EHR) research. Conventional approaches in ML use cross-entropy loss (CEL) that often suffers from poor margin classification. For the first time, we show that contrastive loss (CL) improves the performance of CEL especially for imbalanced EHR data and the related COVID-19 analyses. This study has been approved by the Institutional Review Board at the Icahn School of Medicine at Mount Sinai. We use EHR data from five hospitals within the Mount Sinai Health System (MSHS) to predict mortality, intubation, and intensive care unit (ICU) transfer in hospitalized COVID-19 patients over 24 and 48 hour time windows. We train two sequential architectures (RNN and RETAIN) using two loss functions (CEL and CL). Models are tested on full sample data set which contain all available data and restricted data set to emulate higher class imbalance.CL models consistently outperform CEL models with the restricted data set on these tasks with differences ranging from 0.04 to 0.15 for AUPRC and 0.05 to 0.1 for AUROC. For the restricted sample, only the CL model maintains proper clustering and is able to identify important features, such as pulse oximetry. CL outperforms CEL in instances of severe class imbalance, on three EHR outcomes with respect to three performance metrics: predictive power, clustering, and feature importance. We believe that the developed CL framework can be expanded and used for EHR ML work in general.
△ Less
Submitted 11 January, 2021;
originally announced January 2021.
-
Visualizing Deep Graph Generative Models for Drug Discovery
Authors:
Karan Yang,
Chengxi Zang,
Fei Wang
Abstract:
Drug discovery aims at designing novel molecules with specific desired properties for clinical trials. Over past decades, drug discovery and development have been a costly and time consuming process. Driven by big chemical data and AI, deep generative models show great potential to accelerate the drug discovery process. Existing works investigate different deep generative frameworks for molecular…
▽ More
Drug discovery aims at designing novel molecules with specific desired properties for clinical trials. Over past decades, drug discovery and development have been a costly and time consuming process. Driven by big chemical data and AI, deep generative models show great potential to accelerate the drug discovery process. Existing works investigate different deep generative frameworks for molecular generation, however, less attention has been paid to the visualization tools to quickly demo and evaluate model's results. Here, we propose a visualization framework which provides interactive visualization tools to visualize molecules generated during the encoding and decoding process of deep graph generative models, and provide real time molecular optimization functionalities. Our work tries to empower black box AI driven drug discovery models with some visual interpretabilities.
△ Less
Submitted 20 July, 2020;
originally announced July 2020.
-
MoFlow: An Invertible Flow Model for Generating Molecular Graphs
Authors:
Chengxi Zang,
Fei Wang
Abstract:
Generating molecular graphs with desired chemical properties driven by deep graph generative models provides a very promising way to accelerate drug discovery process. Such graph generative models usually consist of two steps: learning latent representations and generation of molecular graphs. However, to generate novel and chemically-valid molecular graphs from latent representations is very chal…
▽ More
Generating molecular graphs with desired chemical properties driven by deep graph generative models provides a very promising way to accelerate drug discovery process. Such graph generative models usually consist of two steps: learning latent representations and generation of molecular graphs. However, to generate novel and chemically-valid molecular graphs from latent representations is very challenging because of the chemical constraints and combinatorial complexity of molecular graphs. In this paper, we propose MoFlow, a flow-based graph generative model to learn invertible mappings between molecular graphs and their latent representations. To generate molecular graphs, our MoFlow first generates bonds (edges) through a Glow based model, then generates atoms (nodes) given bonds by a novel graph conditional flow, and finally assembles them into a chemically valid molecular graph with a posthoc validity correction. Our MoFlow has merits including exact and tractable likelihood training, efficient one-pass embedding and generation, chemical validity guarantees, 100\% reconstruction of training data, and good generalization ability. We validate our model by four tasks: molecular graph generation and reconstruction, visualization of the continuous latent space, property optimization, and constrained property optimization. Our MoFlow achieves state-of-the-art performance, which implies its potential efficiency and effectiveness to explore large chemical space for drug discovery.
△ Less
Submitted 17 June, 2020;
originally announced June 2020.
-
Neural Dynamics on Complex Networks
Authors:
Chengxi Zang,
Fei Wang
Abstract:
Learning continuous-time dynamics on complex networks is crucial for understanding, predicting and controlling complex systems in science and engineering. However, this task is very challenging due to the combinatorial complexities in the structures of high dimensional systems, their elusive continuous-time nonlinear dynamics, and their structural-dynamic dependencies. To address these challenges,…
▽ More
Learning continuous-time dynamics on complex networks is crucial for understanding, predicting and controlling complex systems in science and engineering. However, this task is very challenging due to the combinatorial complexities in the structures of high dimensional systems, their elusive continuous-time nonlinear dynamics, and their structural-dynamic dependencies. To address these challenges, we propose to combine Ordinary Differential Equation Systems (ODEs) and Graph Neural Networks (GNNs) to learn continuous-time dynamics on complex networks in a data-driven manner. We model differential equation systems by GNNs. Instead of mapping through a discrete number of neural layers in the forward process, we integrate GNN layers over continuous time numerically, leading to capturing continuous-time dynamics on graphs. Our model can be interpreted as a Continuous-time GNN model or a Graph Neural ODEs model. Our model can be utilized for continuous-time network dynamics prediction, structured sequence prediction (a regularly-sampled case), and node semi-supervised classification tasks (a one-snapshot case) in a unified framework. We validate our model by extensive experiments in the above three scenarios. The promising experimental results demonstrate our model's capability of jointly capturing the structure and dynamics of complex systems in a unified framework.
△ Less
Submitted 17 June, 2020; v1 submitted 18 August, 2019;
originally announced August 2019.
-
Quantifying impacts of the drought 2018 on European ecosystems in comparison to 2003
Authors:
Allan Buras,
Anja Rammig,
Christian S. Zang
Abstract:
In recent decades, an increasing persistence of atmospheric circulation patterns has been observed. In the course of the associated long-lasting anticyclonic summer circulations, heat waves and drought spells often coincide, leading to so-called hotter droughts. Previous hotter droughts caused a decrease in agricultural yields and increase in tree mortality, and thus, had a remarkable effect on ca…
▽ More
In recent decades, an increasing persistence of atmospheric circulation patterns has been observed. In the course of the associated long-lasting anticyclonic summer circulations, heat waves and drought spells often coincide, leading to so-called hotter droughts. Previous hotter droughts caused a decrease in agricultural yields and increase in tree mortality, and thus, had a remarkable effect on carbon budgets and negative economic impacts. Consequently, a quantification of ecosystem responses to hotter droughts and a better understanding of the underlying mechanisms is crucial. In this context, the European hotter drought of the year 2018 may be considered as a key event. As a first step towards the quantification of its causes and consequences, we here assess anomalies of atmospheric circulation patterns, temperature loads, and climatic water balance as potential drivers of ecosystem responses as quantified by remote sensing using the MODIS vegetation indices NDVI and EVI. To place the drought of 2018 within a climatological context, we compare its climatic features and ecosystem response with the extreme hot drought of 2003. Our results indicated 2018 to be characterized by a climatic dipole, featuring extremely hot and dry weather conditions north of the Alps but comparably cool and moist conditions across large parts of the Mediterranean. Analyzing ecosystem response of five dominant land-cover classes, we found significant positive effects of April-July climatic water balance on ecosystem productivity. Negative drought impacts appeared to affect a larger area in 2018 compared to 2003. We found a significantly higher sensitivity of pastures and arable land to climatic water balance compared to forests in both years. This study quantifies the drought of 2018 as a yet unprecedented event and provides valuable insights into the heterogeneous drought responses of European ecosystems.
△ Less
Submitted 15 July, 2019; v1 submitted 7 June, 2019;
originally announced June 2019.
-
Deep Learning in Multiple Multistep Time Series Prediction
Authors:
Chuanyun Zang
Abstract:
The project aims to research on combining deep learning specifically Long-Short Memory (LSTM) and basic statistics in multiple multistep time series prediction. LSTM can dive into all the pages and learn the general trends of variation in a large scope, while the well selected medians for each page can keep the special seasonality of different pages so that the future trend will not fluctuate too…
▽ More
The project aims to research on combining deep learning specifically Long-Short Memory (LSTM) and basic statistics in multiple multistep time series prediction. LSTM can dive into all the pages and learn the general trends of variation in a large scope, while the well selected medians for each page can keep the special seasonality of different pages so that the future trend will not fluctuate too much from the reality. A recent Kaggle competition on 145K Web Traffic Time Series Forecasting [1] is used to thoroughly illustrate and test this idea.
△ Less
Submitted 12 October, 2017;
originally announced October 2017.
-
OGLE-2016-BLG-0613LABb: A Microlensing Planet in a Binary System
Authors:
C. Han,
A. Udalski,
A. Gould,
C. -U. Lee,
Y. Shvartzvald,
W. C. Zang,
S. Mao,
S. Kozłowski,
M. D. Albrow,
S. -J. Chung,
K. -H. Hwang,
Y. K. Jung,
D. Kim,
H. -W. Kim,
Y. -H. Ryu,
I. -G. Shin,
J. C. Yee,
W. Zhu,
S. -M. Cha,
S. -L. Kim,
D. -J. Kim,
Y. Lee,
B. -G. Park,
J. Skowron,
P. Mróz
, et al. (15 additional authors not shown)
Abstract:
We present the analysis of OGLE-2016-BLG-0613, for which the lensing light curve appears to be that of a typical binary-lens event with two caustic spikes but with a discontinuous feature on the trough between the spikes. We find that the discontinuous feature was produced by a planetary companion to the binary lens. We find 4 degenerate triple-lens solution classes, each composed of a pair of sol…
▽ More
We present the analysis of OGLE-2016-BLG-0613, for which the lensing light curve appears to be that of a typical binary-lens event with two caustic spikes but with a discontinuous feature on the trough between the spikes. We find that the discontinuous feature was produced by a planetary companion to the binary lens. We find 4 degenerate triple-lens solution classes, each composed of a pair of solutions according to the well-known wide/close planetary degeneracy. One of these solution classes is excluded due to its relatively poor fit. For the remaining three pairs of solutions, the most-likely primary mass is about $M_1\sim 0.7\,M_\odot$ while the planet is a super-Jupiter. In all cases the system lies in the Galactic disk, about half-way toward the Galactic bulge. However, in one of these three solution classes, the secondary of the binary system is a low-mass brown dwarf, with relative mass ratios (1 : 0.03 : 0.003), while in the two others the masses of the binary components are comparable. These two possibilities can be distinguished in about 2024 when the measured lens-source relative proper motion will permit separate resolution of the lens and source.
△ Less
Submitted 2 October, 2017;
originally announced October 2017.
-
Structural patterns of information cascades and their implications for dynamics and semantics
Authors:
Chengxi Zang,
Peng Cui,
Chaoming Song,
Christos Faloutsos,
Wenwu Zhu
Abstract:
Information cascades are ubiquitous in both physical society and online social media, taking on large variations in structures, dynamics and semantics. Although the dynamics and semantics of information cascades have been studied, the structural patterns and their correlations with dynamics and semantics are largely unknown. Here we explore a large-scale dataset including $432$ million information…
▽ More
Information cascades are ubiquitous in both physical society and online social media, taking on large variations in structures, dynamics and semantics. Although the dynamics and semantics of information cascades have been studied, the structural patterns and their correlations with dynamics and semantics are largely unknown. Here we explore a large-scale dataset including $432$ million information cascades with explicit records of spreading traces, spreading behaviors, information content as well as user profiles. We find that the structural complexity of information cascades is far beyond the previous conjectures. We first propose a ten-dimensional metric to quantify the structural characteristics of information cascades, reflecting cascade size, silhouette, direction and activity aspects. We find that bimodal law governs majority of the metrics, information flows in cascades have four directions, and the self-loop number and average activity of cascades follows power law. We then analyze the high-order structural patterns of information cascades. Finally, we evaluate to what extent the structural features of information cascades can explain its dynamic patterns and semantics, and finally uncover some notable implications of structural patterns in information cascades. Our discoveries also provide a foundation for the microscopic mechanisms for information spreading, potentially leading to implications for cascade prediction and outlier detection.
△ Less
Submitted 8 August, 2017;
originally announced August 2017.
-
Matchings in $k$-partite $k$-uniform Hypergraphs
Authors:
Jie Han,
Chuanyun Zang,
Yi Zhao
Abstract:
For $k\ge 3$ and $ε>0$, let $H$ be a $k$-partite $k$-graph with parts $V_1,\dots, V_k$ each of size $n$, where $n$ is sufficiently large. Assume that for each $i\in [k]$, every $(k-1)$-set in $\prod_{j\in [k]\setminus \{i\}} V_i$ lies in at least $a_i$ edges, and $a_1\ge a_2\ge \cdots \ge a_k$. We show that if $a_1, a_2\ge εn$, then $H$ contains a matching of size…
▽ More
For $k\ge 3$ and $ε>0$, let $H$ be a $k$-partite $k$-graph with parts $V_1,\dots, V_k$ each of size $n$, where $n$ is sufficiently large. Assume that for each $i\in [k]$, every $(k-1)$-set in $\prod_{j\in [k]\setminus \{i\}} V_i$ lies in at least $a_i$ edges, and $a_1\ge a_2\ge \cdots \ge a_k$. We show that if $a_1, a_2\ge εn$, then $H$ contains a matching of size $\min\{n-1, \sum_{i\in [k]}a_i\}$. In particular, $H$ contains a matching of size $n-1$ if each crossing $(k-1)$-set lies in at least $\lceil n/k \rceil$ edges, or each crossing $(k-1)$-set lies in at least $\lfloor n/k \rfloor$ edges and $n\equiv 1\bmod k$. This special case answers a question of Rödl and Ruciński and was independently obtained by Lu, Wang, and Yu.
The proof of Lu, Wang, and Yu closely follows the approach of Han [Combin. Probab. Comput. 24 (2015), 723--732] by using the absorbing method and considering an extremal case. In contrast, our result is more general and its proof is thus more involved: it uses a more complex absorbing method and deals with two extremal cases.
△ Less
Submitted 17 February, 2018; v1 submitted 1 November, 2016;
originally announced November 2016.
-
The Strong Chromatic Index of graphs with maximum degree $Δ$
Authors:
Chuanyun Zang
Abstract:
A strong edge-coloring of a graph $G$ is an edge-coloring such that no two edges of distance at most two receive the same color. The strong chromatic index $χ'_s(G)$ is the minimum number of colors in a strong edge-coloring of $G$. P. Erdős and J. Nešetřil conjectured in 1985 that $χ'_s(G)$ is bounded above by $\frac54Δ^2$ when $Δ$ is even and $\frac14(5Δ^2-2Δ+1)$ when $Δ$ is odd, where $Δ$ is the…
▽ More
A strong edge-coloring of a graph $G$ is an edge-coloring such that no two edges of distance at most two receive the same color. The strong chromatic index $χ'_s(G)$ is the minimum number of colors in a strong edge-coloring of $G$. P. Erdős and J. Nešetřil conjectured in 1985 that $χ'_s(G)$ is bounded above by $\frac54Δ^2$ when $Δ$ is even and $\frac14(5Δ^2-2Δ+1)$ when $Δ$ is odd, where $Δ$ is the maximum degree of $G$. In this paper, we give an algorithm that uses at most $2Δ^2-3Δ+2$ colors for graphs with girth at least $5$. And in particular, we prove that any graph with maximum degree $Δ=5$ has a strong edge-coloring with $37$ colors.
△ Less
Submitted 3 October, 2015;
originally announced October 2015.
-
Topological Hierarchy Insulators and Topological Fractal Insulators
Authors:
Jing He,
Chun-Li Zang,
Ying Liang,
Su-Peng Kou
Abstract:
Topological insulators are new states of quantum matter with metallic edge/surface states. In this paper, we pointed out that there exists a new type of particle-hole symmetry-protected topological insulator - topological hierarchy insulator (THI), a composite topological state of a (parent) topological insulator and its defect-induced topological mid-gap states. A particular type of THI is topolo…
▽ More
Topological insulators are new states of quantum matter with metallic edge/surface states. In this paper, we pointed out that there exists a new type of particle-hole symmetry-protected topological insulator - topological hierarchy insulator (THI), a composite topological state of a (parent) topological insulator and its defect-induced topological mid-gap states. A particular type of THI is topological fractal insulator, that is a THI with self-similar topological structure. In the end, we discuss the possible experimental realizations of THIs.
△ Less
Submitted 5 June, 2015;
originally announced September 2015.
-
Effects of Attractive correlation on Topological Flat-bands Model
Authors:
Chun-Li Zang,
Jing He,
Ya-Jie Wu
Abstract:
In this paper, we study the effects of attractive correlation on the topological insulator ($TI$) with topological flat-bands using an extended attractive Kane-Mele-Hubbard model (KMHM). In the KMHM, we found a quantum phase transition from $TI$ to the superconductor ($SC$) state upon the increasing of the attractive Hubbard interaction $U$ at the mean field level. This type of $SC$ phase transiti…
▽ More
In this paper, we study the effects of attractive correlation on the topological insulator ($TI$) with topological flat-bands using an extended attractive Kane-Mele-Hubbard model (KMHM). In the KMHM, we found a quantum phase transition from $TI$ to the superconductor ($SC$) state upon the increasing of the attractive Hubbard interaction $U$ at the mean field level. This type of $SC$ phase transition is different from the traditional $SC$ phase transition which develops from the gapless Fermi Liquid. Cooperon-type gapped excitations exist in the $TI$ side near this type of $SC$ phase transition.
△ Less
Submitted 9 April, 2015;
originally announced April 2015.
-
Minimum vertex degree thresholds for tiling complete 3-partite 3-graphs
Authors:
Jie Han,
Chuanyun Zang,
Yi Zhao
Abstract:
Given positive integers $a\leq b \leq c$, let $K_{a,b,c}$ be the complete 3-partite 3-uniform hypergraph with three parts of sizes $a,b,c$. Let $H$ be a 3-uniform hypergraph on $n$ vertices where $n$ is divisible by $a+b+c$. We asymptotically determine the minimum vertex degree of $H$ that guarantees a perfect $K_{a, b, c}$-tiling, that is, a spanning subgraph of $H$ consisting of vertex-disjoint…
▽ More
Given positive integers $a\leq b \leq c$, let $K_{a,b,c}$ be the complete 3-partite 3-uniform hypergraph with three parts of sizes $a,b,c$. Let $H$ be a 3-uniform hypergraph on $n$ vertices where $n$ is divisible by $a+b+c$. We asymptotically determine the minimum vertex degree of $H$ that guarantees a perfect $K_{a, b, c}$-tiling, that is, a spanning subgraph of $H$ consisting of vertex-disjoint copies of $K_{a, b, c}$. This partially answers a question of Mycroft, who proved an analogous result with respect to codegree for $r$-uniform hypergraphs for all $r\ge 3$. Our proof uses a lattice-based absorbing method, the concept of fractional tiling, and a recent result on shadows for 3-graphs.
△ Less
Submitted 13 August, 2017; v1 submitted 30 March, 2015;
originally announced March 2015.
-
Magnetic Orders of Correlated Topological Insulators at Finite Temperature
Authors:
Ying-Xue Zhu,
Jing He,
Chun-Li Zang,
Ying Liang,
Su-Peng Kou
Abstract:
In this paper, we study the magnetic orders of two dimensional correlated topological insulators including the correlated Chern insulator and the correlated Z2 topological insulator at finite temperature. For the 2D correlated Chern insulator, we found that thermal-fluctuation-induced magnetic order appears in the intermediate interaction region of the correlated Chern insulator. On the contrary,…
▽ More
In this paper, we study the magnetic orders of two dimensional correlated topological insulators including the correlated Chern insulator and the correlated Z2 topological insulator at finite temperature. For the 2D correlated Chern insulator, we found that thermal-fluctuation-induced magnetic order appears in the intermediate interaction region of the correlated Chern insulator. On the contrary, for the correlated Z2 topological insulator there doesn't exist the thermal-fluctuation-induced magnetic order. In the end, we give an explanation on the difference.
△ Less
Submitted 21 September, 2013;
originally announced September 2013.
-
Preparation of NOON State Induced by Macroscopic Quantum Tunneling in an Ising Chain
Authors:
Chun-Li Zang,
Jing Yu,
Wan-Li Yang,
Mang Feng,
Su-Peng Kou
Abstract:
In this brief report, we propose a possible way, theoretically and experimentally, to generate a NOON state of the two degenerate ferromagnetic ground states of the Transverse Ising Model. In our scheme we employ the macroscopic quantum tunneling (MQT) effect between the two degenerate ferromagnetic ground states to realize the NOON state. Our calculation about the MQT process is based on a higher…
▽ More
In this brief report, we propose a possible way, theoretically and experimentally, to generate a NOON state of the two degenerate ferromagnetic ground states of the Transverse Ising Model. In our scheme we employ the macroscopic quantum tunneling (MQT) effect between the two degenerate ferromagnetic ground states to realize the NOON state. Our calculation about the MQT process is based on a higher-order degenerate perturbation method. After doing a transformation, the MQT process could also be treated as the hopping of individual virtual fermions in the spin chain, which will leads to an analytical description of tunneling process. The experimental feasibility for generating the NOON state is discussed in the setup of linear ion trap.
△ Less
Submitted 25 October, 2012;
originally announced October 2012.
-
Topological superfluid in a fermionic bilayer optical lattice
Authors:
Ya-Jie Wu,
Jing He,
Chun-Li Zang,
Su-Peng Kou
Abstract:
In this paper, a topological superfluid phase with Chern number C=1 possessing gapless edge states and non-Abelian anyons is designed in a C=1 topological insulator proximity to an s-wave superfluid on an optical lattice with the effective gauge field and layer-dependent Zeeman field coupled to ultracold fermionic atoms pseudo spin. We also study its topological properties and calculate the phase…
▽ More
In this paper, a topological superfluid phase with Chern number C=1 possessing gapless edge states and non-Abelian anyons is designed in a C=1 topological insulator proximity to an s-wave superfluid on an optical lattice with the effective gauge field and layer-dependent Zeeman field coupled to ultracold fermionic atoms pseudo spin. We also study its topological properties and calculate the phase stiffness by using the random-phase-approximation approach. Finally we derive the temperature of the Kosterlitz-Thouless transition by means of renormalized group theory. Owning to the existence of non-Abelian anyons, this C=1 topological superfluid may be a possible candidate for topological quantum computation.
△ Less
Submitted 24 September, 2012;
originally announced September 2012.
-
Topological Superfluid in P-band Optical Lattice
Authors:
Ya-Jie Wu,
Jing He,
Chun-Li Zang,
Su-Peng Kou
Abstract:
In this paper by studying p-band fermionic system with nearest neighbor attractive interaction we find translation symmetry protected Z2 topological superfluid (TSF) that is characterized by a special fermion parity pattern at high symmetry points in momentum space k=(0,0), (0,pi), (pi,0), (pi,pi). Such Z2 TSF supports the robust Majorana edge modes and a new type of low energy excitation - (super…
▽ More
In this paper by studying p-band fermionic system with nearest neighbor attractive interaction we find translation symmetry protected Z2 topological superfluid (TSF) that is characterized by a special fermion parity pattern at high symmetry points in momentum space k=(0,0), (0,pi), (pi,0), (pi,pi). Such Z2 TSF supports the robust Majorana edge modes and a new type of low energy excitation - (supersymmetric) Z2 link-excitation. In the end we address a possible realization of such interacting p-band fermions with Z2 TSF.
△ Less
Submitted 22 October, 2011;
originally announced October 2011.