-
A Robust Optimization Model for Cost-Efficient and Fast Electric Vehicle Charging with L2-norm Uncertainty
Authors:
Trung Duc Tran,
Ngoc-Doanh Nguyen,
Hong T. M. Chu,
Laurent El Ghaoui,
Luca Ambrosino,
Giuseppe Calafiore
Abstract:
In this paper, we propose a robust optimization model that addresses both the cost-efficiency and fast charging requirements for electric vehicles (EVs) at charging stations. By combining elements from traditional cost-minimization models and a fast charging objective, we construct an optimization model that balances user costs with rapid power allocation. Additionally, we incorporate L2-norm unce…
▽ More
In this paper, we propose a robust optimization model that addresses both the cost-efficiency and fast charging requirements for electric vehicles (EVs) at charging stations. By combining elements from traditional cost-minimization models and a fast charging objective, we construct an optimization model that balances user costs with rapid power allocation. Additionally, we incorporate L2-norm uncertainty into the charging cost, ensuring that the model remains resilient under cost fluctuations. The proposed model is tested under real-world scenarios and demonstrates its potential for efficient and flexible EV charging solutions.
△ Less
Submitted 6 February, 2025;
originally announced February 2025.
-
AdaCS: Adaptive Normalization for Enhanced Code-Switching ASR
Authors:
The Chuong Chu,
Vu Tuan Dat Pham,
Kien Dao,
Hoang Nguyen,
Quoc Hung Truong
Abstract:
Intra-sentential code-switching (CS) refers to the alternation between languages that happens within a single utterance and is a significant challenge for Automatic Speech Recognition (ASR) systems. For example, when a Vietnamese speaker uses foreign proper names or specialized terms within their speech. ASR systems often struggle to accurately transcribe intra-sentential CS due to their training…
▽ More
Intra-sentential code-switching (CS) refers to the alternation between languages that happens within a single utterance and is a significant challenge for Automatic Speech Recognition (ASR) systems. For example, when a Vietnamese speaker uses foreign proper names or specialized terms within their speech. ASR systems often struggle to accurately transcribe intra-sentential CS due to their training on monolingual data and the unpredictable nature of CS. This issue is even more pronounced for low-resource languages, where limited data availability hinders the development of robust models. In this study, we propose AdaCS, a normalization model integrates an adaptive bias attention module (BAM) into encoder-decoder network. This novel approach provides a robust solution to CS ASR in unseen domains, thereby significantly enhancing our contribution to the field. By utilizing BAM to both identify and normalize CS phrases, AdaCS enhances its adaptive capabilities with a biased list of words provided during inference. Our method demonstrates impressive performance and the ability to handle unseen CS phrases across various domains. Experiments show that AdaCS outperforms previous state-of-the-art method on Vietnamese CS ASR normalization by considerable WER reduction of 56.2% and 36.8% on the two proposed test sets.
△ Less
Submitted 13 January, 2025;
originally announced January 2025.
-
Deep Multimodal Fusion for Surgical Feedback Classification
Authors:
Rafal Kocielnik,
Elyssa Y. Wong,
Timothy N. Chu,
Lydia Lin,
De-An Huang,
Jiayun Wang,
Anima Anandkumar,
Andrew J. Hung
Abstract:
Quantification of real-time informal feedback delivered by an experienced surgeon to a trainee during surgery is important for skill improvements in surgical training. Such feedback in the live operating room is inherently multimodal, consisting of verbal conversations (e.g., questions and answers) as well as non-verbal elements (e.g., through visual cues like pointing to anatomic elements). In th…
▽ More
Quantification of real-time informal feedback delivered by an experienced surgeon to a trainee during surgery is important for skill improvements in surgical training. Such feedback in the live operating room is inherently multimodal, consisting of verbal conversations (e.g., questions and answers) as well as non-verbal elements (e.g., through visual cues like pointing to anatomic elements). In this work, we leverage a clinically-validated five-category classification of surgical feedback: "Anatomic", "Technical", "Procedural", "Praise" and "Visual Aid". We then develop a multi-label machine learning model to classify these five categories of surgical feedback from inputs of text, audio, and video modalities. The ultimate goal of our work is to help automate the annotation of real-time contextual surgical feedback at scale. Our automated classification of surgical feedback achieves AUCs ranging from 71.5 to 77.6 with the fusion improving performance by 3.1%. We also show that high-quality manual transcriptions of feedback audio from experts improve AUCs to between 76.5 and 96.2, which demonstrates a clear path toward future improvements. Empirically, we find that the Staged training strategy, with first pre-training each modality separately and then training them jointly, is more effective than training different modalities altogether. We also present intuitive findings on the importance of modalities for different feedback categories. This work offers an important first look at the feasibility of automated classification of real-world live surgical feedback based on text, audio, and video modalities.
△ Less
Submitted 5 December, 2023;
originally announced December 2023.
-
Maximizing the performance for microcomb based microwave photonic transversal signal processors
Authors:
Yang Sun,
Jiayang Wu,
Yang Li,
Xingyuan Xu,
Guanghui Ren,
Mengxi Tan,
Sai Tak Chu,
Brent E. Little,
Roberto Morandotti,
Arnan Mitchell,
David J. Moss
Abstract:
Microwave photonic (MWP) transversal signal processors offer a compelling solution for realizing versatile high-speed information processing by combining the advantages of reconfigurable electrical digital signal processing and high-bandwidth photonic processing. With the capability of generating a number of discrete wavelengths from micro-scale resonators, optical microcombs are powerful multi-wa…
▽ More
Microwave photonic (MWP) transversal signal processors offer a compelling solution for realizing versatile high-speed information processing by combining the advantages of reconfigurable electrical digital signal processing and high-bandwidth photonic processing. With the capability of generating a number of discrete wavelengths from micro-scale resonators, optical microcombs are powerful multi-wavelength sources for implementing MWP transversal signal processors with significantly reduced size, power consumption, and complexity. By using microcomb-based MWP transversal signal processors, a diverse range of signal processing functions have been demonstrated recently. In this paper, we provide a detailed analysis for the processing inaccuracy that is induced by the imperfect response of experimental components. First, we investigate the errors arising from different sources including imperfections in the microcombs, the chirp of electro-optic modulators, chromatic dispersion of the dispersive module, shaping errors of the optical spectral shapers, and noise of the photodetector. Next, we provide a global picture quantifying the impact of different error sources on the overall system performance. Finally, we introduce feedback control to compensate the errors caused by experimental imperfections and achieve significantly improved accuracy. These results provide a guide for optimizing the accuracy of microcomb-based MWP transversal signal processors.
△ Less
Submitted 10 September, 2023;
originally announced September 2023.
-
Quotients of probabilistic Boolean networks
Authors:
Rui Li,
Qi Zhang,
Tianguang Chu
Abstract:
A probabilistic Boolean network (PBN) is a discrete-time system composed of a collection of Boolean networks between which the PBN switches in a stochastic manner. This paper focuses on the study of quotients of PBNs. Given a PBN and an equivalence relation on its state set, we consider a probabilistic transition system that is generated by the PBN; the resulting quotient transition system then au…
▽ More
A probabilistic Boolean network (PBN) is a discrete-time system composed of a collection of Boolean networks between which the PBN switches in a stochastic manner. This paper focuses on the study of quotients of PBNs. Given a PBN and an equivalence relation on its state set, we consider a probabilistic transition system that is generated by the PBN; the resulting quotient transition system then automatically captures the quotient behavior of this PBN. We therefore describe a method for obtaining a probabilistic Boolean system that generates the transitions of the quotient transition system. Applications of this quotient description are discussed, and it is shown that for PBNs, controller synthesis can be performed easily by first controlling a quotient system and then lifting the control law back to the original network. A biological example is given to show the usefulness of the developed results.
△ Less
Submitted 30 July, 2021;
originally announced July 2021.
-
Towards the Objective Speech Assessment of Smoking Status based on Voice Features: A Review of the Literature
Authors:
Zhizhong Ma,
Chris Bullen,
Joanna Ting Wai Chu,
Ruili Wang,
Yingchun Wang,
Satwinder Singh
Abstract:
In smoking cessation clinical research and practice, objective validation of self-reported smoking status is crucial for ensuring the reliability of the primary outcome, that is, smoking abstinence. Speech signals convey important information about a speaker, such as age, gender, body size, emotional state, and health state. We investigated (1) if smoking could measurably alter voice features, (2)…
▽ More
In smoking cessation clinical research and practice, objective validation of self-reported smoking status is crucial for ensuring the reliability of the primary outcome, that is, smoking abstinence. Speech signals convey important information about a speaker, such as age, gender, body size, emotional state, and health state. We investigated (1) if smoking could measurably alter voice features, (2) if smoking cessation could lead to changes in voice, and therefore (3) if the voice-based smoking status assessment has the potential to be used as an objective smoking cessation validation method.
△ Less
Submitted 15 June, 2021;
originally announced June 2021.
-
PowerNet: Multi-agent Deep Reinforcement Learning for Scalable Powergrid Control
Authors:
Dong Chen,
Kaian Chen. Zhaojian Li,
Tianshu Chu,
Rui Yao,
Feng Qiu,
Kaixiang Lin
Abstract:
This paper develops an efficient multi-agent deep reinforcement learning algorithm for cooperative controls in powergrids. Specifically, we consider the decentralized inverter-based secondary voltage control problem in distributed generators (DGs), which is first formulated as a cooperative multi-agent reinforcement learning (MARL) problem. We then propose a novel on-policy MARL algorithm, PowerNe…
▽ More
This paper develops an efficient multi-agent deep reinforcement learning algorithm for cooperative controls in powergrids. Specifically, we consider the decentralized inverter-based secondary voltage control problem in distributed generators (DGs), which is first formulated as a cooperative multi-agent reinforcement learning (MARL) problem. We then propose a novel on-policy MARL algorithm, PowerNet, in which each agent (DG) learns a control policy based on (sub-)global reward but local states from its neighboring agents. Motivated by the fact that a local control from one agent has limited impact on agents distant from it, we exploit a novel spatial discount factor to reduce the effect from remote agents, to expedite the training process and improve scalability. Furthermore, a differentiable, learning-based communication protocol is employed to foster the collaborations among neighboring agents. In addition, to mitigate the effects of system uncertainty and random noise introduced during on-policy learning, we utilize an action smoothing factor to stabilize the policy execution. To facilitate training and evaluation, we develop PGSim, an efficient, high-fidelity powergrid simulation platform. Experimental results in two microgrid setups show that the developed PowerNet outperforms a conventional model-based control, as well as several state-of-the-art MARL algorithms. The decentralized learning scheme and high sample efficiency also make it viable to large-scale power grids.
△ Less
Submitted 31 July, 2021; v1 submitted 24 November, 2020;
originally announced November 2020.
-
On quotients of Boolean control networks
Authors:
Rui Li,
Qi Zhang,
Tianguang Chu
Abstract:
In this paper, we focus on the study of quotients of Boolean control networks (BCNs) with the motivation that they might serve as smaller models that still carry enough information about the original network. Given a BCN and an equivalence relation on the state set, we consider a labeled transition system that is generated by the BCN. The resulting quotient transition system then naturally capture…
▽ More
In this paper, we focus on the study of quotients of Boolean control networks (BCNs) with the motivation that they might serve as smaller models that still carry enough information about the original network. Given a BCN and an equivalence relation on the state set, we consider a labeled transition system that is generated by the BCN. The resulting quotient transition system then naturally captures the quotient dynamics of the BCN concerned. We therefore develop a method for constructing a Boolean system that behaves equivalently to the resulting quotient transition system. The use of the obtained quotient system for control design is discussed and we show that for BCNs, controller synthesis can be done by first designing a controller for a quotient and subsequently lifting it to the original model. We finally demonstrate the applicability of the proposed techniques on a biological example.
△ Less
Submitted 10 July, 2020;
originally announced July 2020.
-
Photonic RF channelizer based on a 90 wavelength optical soliton crystal 49GHz Kerr microcomb
Authors:
Xingyuan Xu,
Mengxi Tan,
Jiayang Wu,
Andreas Boes,
Thach G. Nguyen,
Sai T. Chu,
Brent E. Little,
Roberto Morandotti,
Arnan Mitchell,
David J. Moss
Abstract:
We report a broadband radio frequency (RF) channelizer with up to 92 channels using a coherent microcomb source. A soliton crystal microcomb, generated by a 49 GHz micro-ring resonator (MRR), is used as a multi-wavelength source. Due to its ultra-low comb spacing, up to 92 wavelengths are available in the C band, yielding a broad operation bandwidth. Another high-Q MRR is employed as a passive opt…
▽ More
We report a broadband radio frequency (RF) channelizer with up to 92 channels using a coherent microcomb source. A soliton crystal microcomb, generated by a 49 GHz micro-ring resonator (MRR), is used as a multi-wavelength source. Due to its ultra-low comb spacing, up to 92 wavelengths are available in the C band, yielding a broad operation bandwidth. Another high-Q MRR is employed as a passive optical periodic filter to slice the RF spectrum with a high resolution of 121.4 MHz. We experimentally achieve an instantaneous RF operation bandwidth of 8.08 GHz and verify RF channelization up to 17.55 GHz via thermal tuning. Our approach is a significant step towards the monolithically integrated photonic RF receivers with reduced complexity, size, and unprecedented performance, which is important for wide RF applications ranging from broadband analog signal processing to digital-compatible signal detection.
△ Less
Submitted 20 April, 2020;
originally announced May 2020.
-
6G White Paper on Machine Learning in Wireless Communication Networks
Authors:
Samad Ali,
Walid Saad,
Nandana Rajatheva,
Kapseok Chang,
Daniel Steinbach,
Benjamin Sliwa,
Christian Wietfeld,
Kai Mei,
Hamid Shiri,
Hans-Jürgen Zepernick,
Thi My Chinh Chu,
Ijaz Ahmad,
Jyrki Huusko,
Jaakko Suutala,
Shubhangi Bhadauria,
Vimal Bhatia,
Rangeet Mitra,
Saidhiraj Amuru,
Robert Abbas,
Baohua Shao,
Michele Capobianco,
Guanghui Yu,
Maelick Claes,
Teemu Karvonen,
Mingzhe Chen
, et al. (2 additional authors not shown)
Abstract:
The focus of this white paper is on machine learning (ML) in wireless communications. 6G wireless communication networks will be the backbone of the digital transformation of societies by providing ubiquitous, reliable, and near-instant wireless connectivity for humans and machines. Recent advances in ML research has led enable a wide range of novel technologies such as self-driving vehicles and v…
▽ More
The focus of this white paper is on machine learning (ML) in wireless communications. 6G wireless communication networks will be the backbone of the digital transformation of societies by providing ubiquitous, reliable, and near-instant wireless connectivity for humans and machines. Recent advances in ML research has led enable a wide range of novel technologies such as self-driving vehicles and voice assistants. Such innovation is possible as a result of the availability of advanced ML models, large datasets, and high computational power. On the other hand, the ever-increasing demand for connectivity will require a lot of innovation in 6G wireless networks, and ML tools will play a major role in solving problems in the wireless domain. In this paper, we provide an overview of the vision of how ML will impact the wireless communication systems. We first give an overview of the ML methods that have the highest potential to be used in wireless networks. Then, we discuss the problems that can be solved by using ML in various layers of the network such as the physical layer, medium access layer, and application layer. Zero-touch optimization of wireless networks using ML is another interesting aspect that is discussed in this paper. Finally, at the end of each section, important research questions that the section aims to answer are presented.
△ Less
Submitted 28 April, 2020;
originally announced April 2020.
-
Microwave photonic fractional Hilbert transformer with an integrated optical soliton crystal micro-comb
Authors:
Mengxi Tan,
Xingyuan Xu,
Bill Corcoran,
Jiayang Wu,
Andreas Boes,
Thach G. Nguyen,
Sai T. Chu,
Brent E. Little,
Roberto Morandotti,
Arnan Mitchell,
David J. Moss
Abstract:
We report a photonic microwave and RF fractional Hilbert transformer based on an integrated Kerr micro-comb source. The micro-comb source has a free spectral range (FSR) of 50GHz, generating a large number of comb lines that serve as a high-performance multi-wavelength source for the transformer. By programming and shaping the comb lines according to calculated tap weights, we achieve both arbitra…
▽ More
We report a photonic microwave and RF fractional Hilbert transformer based on an integrated Kerr micro-comb source. The micro-comb source has a free spectral range (FSR) of 50GHz, generating a large number of comb lines that serve as a high-performance multi-wavelength source for the transformer. By programming and shaping the comb lines according to calculated tap weights, we achieve both arbitrary fractional orders and a broad operation bandwidth. We experimentally characterize the RF amplitude and phase response for different fractional orders and perform system demonstrations of real-time fractional Hilbert transforms. We achieve a phase ripple of < 0.15 rad within the 3-dB pass-band, with bandwidths ranging from 5 to 9 octaves, depending on the order. The experimental results show good agreement with theory, confirming the effectiveness of our approach as a new way to implement high-performance fractional Hilbert transformers with broad processing bandwidth, high reconfigurability, and greatly reduced size and complexity.
△ Less
Submitted 8 October, 2019;
originally announced October 2019.
-
Photonic single sideband RF generator based on an integrated optical micro-ring resonator
Authors:
Xingyuan Xu,
Jiayang Wu,
Mengxi Tan,
Thach G. Nguyen,
Sai T. Chu,
Brent E. Little,
Roberto Morandotti,
Arnan Mitchell,
David J. Moss
Abstract:
We demonstrate narrowband orthogonally polarized optical RF single sideband generation as well as dual-channel equalization based on an integrated dual-polarization-mode high-Q microring resonator. The device operates in the optical communications band and enables narrowband RF operation at either 16.6 GHz or 32.2 GHz, determined by the free spectral range and TE/TM mode interval in the resonator.…
▽ More
We demonstrate narrowband orthogonally polarized optical RF single sideband generation as well as dual-channel equalization based on an integrated dual-polarization-mode high-Q microring resonator. The device operates in the optical communications band and enables narrowband RF operation at either 16.6 GHz or 32.2 GHz, determined by the free spectral range and TE/TM mode interval in the resonator. We achieve a very large dynamic tuning range of over 55 dB for both the optical carrier-to-sideband ratio and the dual-channel RF equalization.
△ Less
Submitted 7 August, 2018;
originally announced August 2018.
-
Optical wavelength conversion of high bandwidth phase-encoded signals in a high FOM 50cm CMOS compatible waveguide
Authors:
Francesco Da Ros,
Edson Porto da Silva,
Darko Zibar,
Sai T. Chu,
Brent E. Little,
Roberto Morandotti,
Michael Galili,
David J. Moss,
Leif K. Oxenløwe
Abstract:
We demonstrate wavelength conversion of QAM signals including 32GBd QPSK and 10GBd 16QAM in a 50cm long high index doped glass spiral waveguide. The quality of the generated idlers over a 10nm bandwidth is sufficient to achieve a BER performance below the HD FEC threshold (less than 3.8 x 10-3), with an OSNR penalty of less than 0.3 dB compared to the original signal. Our results confirm that this…
▽ More
We demonstrate wavelength conversion of QAM signals including 32GBd QPSK and 10GBd 16QAM in a 50cm long high index doped glass spiral waveguide. The quality of the generated idlers over a 10nm bandwidth is sufficient to achieve a BER performance below the HD FEC threshold (less than 3.8 x 10-3), with an OSNR penalty of less than 0.3 dB compared to the original signal. Our results confirm that this is a promising platform for nonlinear optical signal processing, a result of both very low linear propagation loss (less than 0.07 dB/cm) and the large material bandgap that ensures negligible nonlinear loss at telecom wavelengths.
△ Less
Submitted 7 August, 2018;
originally announced August 2018.
-
High-order Radio Frequency Differentiation via Photonic Signal Processing with an Integrated Micro-resonator Kerr Optical Frequency Comb Source
Authors:
Xingyuan Xu,
Jiayang Wu,
Mehrdad Shoeiby,
Sai T. Chu,
Brent E. Little,
Roberto Morandotti,
Arnan Mitchell,
David J. Moss
Abstract:
We demonstrate the use of integrated micro-resonator based optical frequency comb sources as the basis for transversal filtering functions for microwave and radio frequency photonic filtering and advanced functions.
We demonstrate the use of integrated micro-resonator based optical frequency comb sources as the basis for transversal filtering functions for microwave and radio frequency photonic filtering and advanced functions.
△ Less
Submitted 7 April, 2018;
originally announced April 2018.
-
Cloud Resource Allocation for Cloud-Based Automotive Applications
Authors:
Zhaojian Li,
Tianshu Chu,
Ilya V. Kolmanovsky,
Xiang Yin,
Xunyuan Yin
Abstract:
There is a rapidly growing interest in the use of cloud computing for automotive vehicles to facilitate computation and data intensive tasks. Efficient utilization of on-demand cloud resources holds a significant potential to improve future vehicle safety, comfort, and fuel economy. In the meanwhile, issues like cyber security and resource allocation pose great challenges. In this paper, we treat…
▽ More
There is a rapidly growing interest in the use of cloud computing for automotive vehicles to facilitate computation and data intensive tasks. Efficient utilization of on-demand cloud resources holds a significant potential to improve future vehicle safety, comfort, and fuel economy. In the meanwhile, issues like cyber security and resource allocation pose great challenges. In this paper, we treat the resource allocation problem for cloud-based automotive systems. Both private and public cloud paradigms are considered where a private cloud provides an internal, company-owned internet service dedicated to its own vehicles while a public cloud serves all subscribed vehicles. This paper establishes comprehensive models of cloud resource provisioning for both private and public cloud- based automotive systems. Complications such as stochastic communication delays and task deadlines are explicitly considered. In particular, a centralized resource provisioning model is developed for private cloud and chance constrained optimization is exploited to utilize the cloud resources for best Quality of Services. On the other hand, a decentralized auction-based model is developed for public cloud and reinforcement learning is employed to obtain an optimal bidding policy for a "selfish" agent. Numerical examples are presented to illustrate the effectiveness of the developed techniques.
△ Less
Submitted 17 January, 2017;
originally announced January 2017.
-
Automatic Interpretation of Unordered Point Cloud Data for UAV Navigation in Construction
Authors:
M. D. Phung,
C. H. Quach,
D. T. Chu,
N. Q. Nguyen,
T. H. Dinh,
Q. P. Ha
Abstract:
The objective of this work is to develop a data processing system that can automatically generate waypoints for navigation of an unmanned aerial vehicle (UAV) to inspect surfaces of structures like buildings and bridges. The input includes data recorded by two 2D laser scanners, orthogonally mounted on the UAV, and an inertial measurement unit (IMU). To achieve the goal, algorithms are developed t…
▽ More
The objective of this work is to develop a data processing system that can automatically generate waypoints for navigation of an unmanned aerial vehicle (UAV) to inspect surfaces of structures like buildings and bridges. The input includes data recorded by two 2D laser scanners, orthogonally mounted on the UAV, and an inertial measurement unit (IMU). To achieve the goal, algorithms are developed to process the data collected. They are separated into three major groups: (i) the data registration and filtering to generate a 3D model of the structure and control the density of point clouds for data completeness enhancement; (ii) the surface and obstacle detection to assist the UAV in monitoring tasks; and (iii) the waypoint generation to set the flight path. Experiments on different data sets show that the developed system is able to reconstruct a 3D point cloud of the structure, extract its surfaces and objects, and generate waypoints for the UAV to accomplish inspection tasks.
△ Less
Submitted 12 February, 2017; v1 submitted 22 December, 2016;
originally announced December 2016.