Search | arXiv e-print repository

doi 10.1145/3716863.3718047

Certifying Lyapunov Stability of Black-Box Nonlinear Systems via Counterexample Guided Synthesis (Extended Version)

Authors: Chiao Hsieh, Masaki Waga, Kohei Suenaga

Abstract: Finding Lyapunov functions to certify the stability of control systems has been an important topic for verifying safety-critical systems. Most existing methods on finding Lyapunov functions require access to the dynamics of the system. Accurately describing the complete dynamics of a control system however remains highly challenging in practice. Latest trend of using learning-enabled control syste… ▽ More Finding Lyapunov functions to certify the stability of control systems has been an important topic for verifying safety-critical systems. Most existing methods on finding Lyapunov functions require access to the dynamics of the system. Accurately describing the complete dynamics of a control system however remains highly challenging in practice. Latest trend of using learning-enabled control systems further reduces the transparency. Hence, a method for black-box systems would have much wider applications. Our work stems from the recent idea of sampling and exploiting Lipschitz continuity to approximate the unknown dynamics. Given Lipschitz constants, one can derive a non-statistical upper bounds on approximation errors; hence a strong certification on this approximation can certify the unknown dynamics. We significantly improve this idea by directly approximating the Lie derivative of Lyapunov functions instead of the dynamics. We propose a framework based on the learner-verifier architecture from Counterexample-Guided Inductive Synthesis (CEGIS). Our insight of combining regional verification conditions and counterexample-guided sampling enables a guided search for samples to prove stability region-by-region. Our CEGIS algorithm further ensures termination. Our numerical experiments suggest that it is possible to prove the stability of 2D and 3D systems with a few thousands of samples. Our visualization also reveals the regions where the stability is difficult to prove. In comparison with the existing black-box approach, our approach at the best case requires less than 0.01% of samples. △ Less

Submitted 15 May, 2025; v1 submitted 1 March, 2025; originally announced March 2025.

Comments: 30 pages, 3 figures. This is the extended version of the same paper published in the 28th International Conference on Hybrid Systems: Computation and Control (HSCC 2025). Add acknowledgements in v2

arXiv:2501.18453 [pdf, other]

Transfer Learning for Keypoint Detection in Low-Resolution Thermal TUG Test Images

Authors: Wei-Lun Chen, Chia-Yeh Hsieh, Yu-Hsiang Kao, Kai-Chun Liu, Sheng-Yu Peng, Yu Tsao

Abstract: This study presents a novel approach to human keypoint detection in low-resolution thermal images using transfer learning techniques. We introduce the first application of the Timed Up and Go (TUG) test in thermal image computer vision, establishing a new paradigm for mobility assessment. Our method leverages a MobileNetV3-Small encoder and a ViTPose decoder, trained using a composite loss functio… ▽ More This study presents a novel approach to human keypoint detection in low-resolution thermal images using transfer learning techniques. We introduce the first application of the Timed Up and Go (TUG) test in thermal image computer vision, establishing a new paradigm for mobility assessment. Our method leverages a MobileNetV3-Small encoder and a ViTPose decoder, trained using a composite loss function that balances latent representation alignment and heatmap accuracy. The model was evaluated using the Object Keypoint Similarity (OKS) metric from the COCO Keypoint Detection Challenge. The proposed model achieves better performance with AP, AP50, and AP75 scores of 0.861, 0.942, and 0.887 respectively, outperforming traditional supervised learning approaches like Mask R-CNN and ViTPose-Base. Moreover, our model demonstrates superior computational efficiency in terms of parameter count and FLOPS. This research lays a solid foundation for future clinical applications of thermal imaging in mobility assessment and rehabilitation monitoring. △ Less

Submitted 30 January, 2025; originally announced January 2025.

Comments: Accepted to AICAS 2025. This is the preprint version

arXiv:2411.18235 [pdf, other]

Certified Training with Branch-and-Bound: A Case Study on Lyapunov-stable Neural Control

Authors: Zhouxing Shi, Cho-Jui Hsieh, Huan Zhang

Abstract: We study the problem of learning Lyapunov-stable neural controllers which provably satisfy the Lyapunov asymptotic stability condition within a region-of-attraction. Compared to previous works which commonly used counterexample guided training on this task, we develop a new and generally formulated certified training framework named CT-BaB, and we optimize for differentiable verified bounds, to pr… ▽ More We study the problem of learning Lyapunov-stable neural controllers which provably satisfy the Lyapunov asymptotic stability condition within a region-of-attraction. Compared to previous works which commonly used counterexample guided training on this task, we develop a new and generally formulated certified training framework named CT-BaB, and we optimize for differentiable verified bounds, to produce verification-friendly models. In order to handle the relatively large region-of-interest, we propose a novel framework of training-time branch-and-bound to dynamically maintain a training dataset of subregions throughout training, such that the hardest subregions are iteratively split into smaller ones whose verified bounds can be computed more tightly to ease the training. We demonstrate that our new training framework can produce models which can be more efficiently verified at test time. On the largest 2D quadrotor dynamical system, verification for our model is more than 5X faster compared to the baseline, while our size of region-of-attraction is 16X larger than the baseline. △ Less

Submitted 27 November, 2024; originally announced November 2024.

Comments: Preprint

arXiv:2411.05361 [pdf, ps, other]

Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks

Authors: Chien-yu Huang, Wei-Chih Chen, Shu-wen Yang, Andy T. Liu, Chen-An Li, Yu-Xiang Lin, Wei-Cheng Tseng, Anuj Diwan, Yi-Jen Shih, Jiatong Shi, William Chen, Chih-Kai Yang, Wenze Ren, Xuanjun Chen, Chi-Yuan Hsiao, Puyuan Peng, Shih-Heng Wang, Chun-Yi Kuan, Ke-Han Lu, Kai-Wei Chang, Fabian Ritter-Gutierrez, Kuan-Po Huang, Siddhant Arora, You-Kuan Lin, Ming To Chuang , et al. (55 additional authors not shown)

Abstract: Multimodal foundation models, such as Gemini and ChatGPT, have revolutionized human-machine interactions by seamlessly integrating various forms of data. Developing a universal spoken language model that comprehends a wide range of natural language instructions is critical for bridging communication gaps and facilitating more intuitive interactions. However, the absence of a comprehensive evaluati… ▽ More Multimodal foundation models, such as Gemini and ChatGPT, have revolutionized human-machine interactions by seamlessly integrating various forms of data. Developing a universal spoken language model that comprehends a wide range of natural language instructions is critical for bridging communication gaps and facilitating more intuitive interactions. However, the absence of a comprehensive evaluation benchmark poses a significant challenge. We present Dynamic-SUPERB Phase-2, an open and evolving benchmark for the comprehensive evaluation of instruction-based universal speech models. Building upon the first generation, this second version incorporates 125 new tasks contributed collaboratively by the global research community, expanding the benchmark to a total of 180 tasks, making it the largest benchmark for speech and audio evaluation. While the first generation of Dynamic-SUPERB was limited to classification tasks, Dynamic-SUPERB Phase-2 broadens its evaluation capabilities by introducing a wide array of novel and diverse tasks, including regression and sequence generation, across speech, music, and environmental audio. Evaluation results show that no model performed well universally. SALMONN-13B excelled in English ASR and Qwen2-Audio-7B-Instruct showed high accuracy in emotion recognition, but current models still require further innovations to handle a broader range of tasks. We open-source all task data and the evaluation pipeline at https://github.com/dynamic-superb/dynamic-superb. △ Less

Submitted 9 June, 2025; v1 submitted 8 November, 2024; originally announced November 2024.

Comments: ICLR 2025

arXiv:2410.23536 [pdf, ps, other]

On Cost-Sensitive Distributionally Robust Log-Optimal Portfolio

Authors: Chung-Han Hsieh, Xiao-Rou Yu

Abstract: This paper addresses a novel \emph{cost-sensitive} distributionally robust log-optimal portfolio problem, where the investor faces \emph{ambiguous} return distributions, and a general convex transaction cost model is incorporated. The uncertainty in the return distribution is quantified using the \emph{Wasserstein} metric, which captures distributional ambiguity. We establish conditions that ensur… ▽ More This paper addresses a novel \emph{cost-sensitive} distributionally robust log-optimal portfolio problem, where the investor faces \emph{ambiguous} return distributions, and a general convex transaction cost model is incorporated. The uncertainty in the return distribution is quantified using the \emph{Wasserstein} metric, which captures distributional ambiguity. We establish conditions that ensure robustly survivable trades for all distributions in the Wasserstein ball under convex transaction costs. By leveraging duality theory, we approximate the infinite-dimensional distributionally robust optimization problem with a finite convex program, enabling computational tractability for mid-sized portfolios. Empirical studies using S\&P 500 data validate our theoretical framework: without transaction costs, the optimal portfolio converges to an equal-weighted allocation, while with transaction costs, the portfolio shifts slightly towards the risk-free asset, reflecting the trade-off between cost considerations and optimal allocation. △ Less

Submitted 30 October, 2024; originally announced October 2024.

Comments: Submitted for possible publication

MSC Class: 91G10; 93E03; 90C17; 90C46; 90C25

arXiv:2408.07879 [pdf, ps, other]

On Accelerating Large-Scale Robust Portfolio Optimization

Authors: Chung-Han Hsieh, Jie-Ling Lu

Abstract: Solving large-scale robust portfolio optimization problems is challenging due to the high computational demands associated with an increasing number of assets, the amount of data considered, and market uncertainty. To address this issue, we propose an extended supporting hyperplane approximation approach for efficiently solving a class of distributionally robust portfolio problems for a general cl… ▽ More Solving large-scale robust portfolio optimization problems is challenging due to the high computational demands associated with an increasing number of assets, the amount of data considered, and market uncertainty. To address this issue, we propose an extended supporting hyperplane approximation approach for efficiently solving a class of distributionally robust portfolio problems for a general class of additively separable utility functions and polyhedral ambiguity distribution set, applied to a large-scale set of assets. Our technique is validated using a large-scale portfolio of the S&P 500 index constituents, demonstrating robust out-of-sample trading performance. More importantly, our empirical studies show that this approach significantly reduces computational time compared to traditional concave Expected Log-Growth (ELG) optimization, with running times decreasing from several thousand seconds to just a few. This method provides a scalable and practical solution to large-scale robust portfolio optimization, addressing both theoretical and practical challenges. △ Less

Submitted 14 August, 2024; originally announced August 2024.

Comments: Submitted to possible publication

MSC Class: 91G10; 90C17; 90C15

arXiv:2408.01951 [pdf, other]

Harmonic MUSIC Method for mmWave Radar-based Vital Sign Estimation

Authors: Chieh-Hsun Hsieh, Tung-Lin Tsai, Po-Hsuan Tseng

Abstract: This paper investigates the application of millimeter-wave (mmWave) radar for the estimation of human vital signs. Aiming to obtain more accurate frequency estimation for periodic signals of respiration and heartbeat, we propose the harmonic MUSIC (HMUSIC) algorithm to consider harmonic components for frequency estimation of vital sign signals. In the experiments, we tested different subjects' vit… ▽ More This paper investigates the application of millimeter-wave (mmWave) radar for the estimation of human vital signs. Aiming to obtain more accurate frequency estimation for periodic signals of respiration and heartbeat, we propose the harmonic MUSIC (HMUSIC) algorithm to consider harmonic components for frequency estimation of vital sign signals. In the experiments, we tested different subjects' vital signs. Experimental results demonstrate that the 89-th percentile errors in respiration rate and the 88-th percentile errors in heartbeat rate are less than 3 respirations per minute and 5 beats per minute. △ Less

Submitted 4 August, 2024; originally announced August 2024.

arXiv:2404.13371 [pdf, ps, other]

On Risk-Sensitive Decision Making Under Uncertainty

Authors: Chung-Han Hsieh, Yi-Shan Wong

Abstract: This paper studies a risk-sensitive decision-making problem under uncertainty. It considers a decision-making process that unfolds over a fixed number of stages, in which a decision-maker chooses among multiple alternatives, some of which are deterministic and others are stochastic. The decision-maker's cumulative value is updated at each stage, reflecting the outcomes of the chosen alternatives.… ▽ More This paper studies a risk-sensitive decision-making problem under uncertainty. It considers a decision-making process that unfolds over a fixed number of stages, in which a decision-maker chooses among multiple alternatives, some of which are deterministic and others are stochastic. The decision-maker's cumulative value is updated at each stage, reflecting the outcomes of the chosen alternatives. After formulating this as a stochastic control problem, we delineate the necessary optimality conditions for it. Two illustrative examples from optimal betting and inventory management are provided to support our theory. △ Less

Submitted 20 April, 2024; originally announced April 2024.

Comments: submitted for possible publication

arXiv:2404.07956 [pdf, other]

Lyapunov-stable Neural Control for State and Output Feedback: A Novel Formulation

Authors: Lujie Yang, Hongkai Dai, Zhouxing Shi, Cho-Jui Hsieh, Russ Tedrake, Huan Zhang

Abstract: Learning-based neural network (NN) control policies have shown impressive empirical performance in a wide range of tasks in robotics and control. However, formal (Lyapunov) stability guarantees over the region-of-attraction (ROA) for NN controllers with nonlinear dynamical systems are challenging to obtain, and most existing approaches rely on expensive solvers such as sums-of-squares (SOS), mixed… ▽ More Learning-based neural network (NN) control policies have shown impressive empirical performance in a wide range of tasks in robotics and control. However, formal (Lyapunov) stability guarantees over the region-of-attraction (ROA) for NN controllers with nonlinear dynamical systems are challenging to obtain, and most existing approaches rely on expensive solvers such as sums-of-squares (SOS), mixed-integer programming (MIP), or satisfiability modulo theories (SMT). In this paper, we demonstrate a new framework for learning NN controllers together with Lyapunov certificates using fast empirical falsification and strategic regularizations. We propose a novel formulation that defines a larger verifiable region-of-attraction (ROA) than shown in the literature, and refines the conventional restrictive constraints on Lyapunov derivatives to focus only on certifiable ROAs. The Lyapunov condition is rigorously verified post-hoc using branch-and-bound with scalable linear bound propagation-based NN verification techniques. The approach is efficient and flexible, and the full training and verification procedure is accelerated on GPUs without relying on expensive solvers for SOS, MIP, nor SMT. The flexibility and efficiency of our framework allow us to demonstrate Lyapunov-stable output feedback control with synthesized NN-based controllers and NN-based observers with formal stability guarantees, for the first time in literature. Source code at https://github.com/Verified-Intelligence/Lyapunov_Stable_NN_Controllers △ Less

Submitted 4 June, 2024; v1 submitted 11 April, 2024; originally announced April 2024.

Comments: Paper accepted by ICML 2024

arXiv:2312.14934 [pdf]

aoip.ai: An Open-Source P2P SDK

Authors: Joseph Konan, Shikhar Agnihotri, Chia-Chun Hsieh

Abstract: This white paper introduces aoip.ai, a groundbreaking open-source SDK incorporating peer-to-peer technology and advanced AI integration to transform VoIP and IoT applications. It addresses key market challenges by enhancing data security, elevating communication quality, and providing greater flexibility for developers and users. Developed in collaboration with Carnegie Mellon University, aoip.ai… ▽ More This white paper introduces aoip.ai, a groundbreaking open-source SDK incorporating peer-to-peer technology and advanced AI integration to transform VoIP and IoT applications. It addresses key market challenges by enhancing data security, elevating communication quality, and providing greater flexibility for developers and users. Developed in collaboration with Carnegie Mellon University, aoip.ai sets a new standard for decentralized and democratized communication solutions. △ Less

Submitted 1 December, 2023; originally announced December 2023.

arXiv:2304.06335 [pdf]

Deep Learning-based Fall Detection Algorithm Using Ensemble Model of Coarse-fine CNN and GRU Networks

Authors: Chien-Pin Liu, Ju-Hsuan Li, En-Ping Chu, Chia-Yeh Hsieh, Kai-Chun Liu, Chia-Tai Chan, Yu Tsao

Abstract: Falls are the public health issue for the elderly all over the world since the fall-induced injuries are associated with a large amount of healthcare cost. Falls can cause serious injuries, even leading to death if the elderly suffers a "long-lie". Hence, a reliable fall detection (FD) system is required to provide an emergency alarm for first aid. Due to the advances in wearable device technology… ▽ More Falls are the public health issue for the elderly all over the world since the fall-induced injuries are associated with a large amount of healthcare cost. Falls can cause serious injuries, even leading to death if the elderly suffers a "long-lie". Hence, a reliable fall detection (FD) system is required to provide an emergency alarm for first aid. Due to the advances in wearable device technology and artificial intelligence, some fall detection systems have been developed using machine learning and deep learning methods to analyze the signal collected from accelerometer and gyroscopes. In order to achieve better fall detection performance, an ensemble model that combines a coarse-fine convolutional neural network and gated recurrent unit is proposed in this study. The parallel structure design used in this model restores the different grains of spatial characteristics and capture temporal dependencies for feature representation. This study applies the FallAllD public dataset to validate the reliability of the proposed model, which achieves a recall, precision, and F-score of 92.54%, 96.13%, and 94.26%, respectively. The results demonstrate the reliability of the proposed ensemble model in discriminating falls from daily living activities and its superior performance compared to the state-of-the-art convolutional neural network long short-term memory (CNN-LSTM) for FD. △ Less

Submitted 13 April, 2023; originally announced April 2023.

arXiv:2303.10806 [pdf, ps, other]

On Robustness of Double Linear Policy with Time-Varying Weights

Authors: Xin-Yu Wang, Chung-Han Hsieh

Abstract: In this paper, we extend the existing double linear policy by incorporating time-varying weights instead of constant weights and study a certain robustness property, called robust positive expectation (RPE), in a discrete-time setting. We prove that the RPE property holds by employing a novel elementary symmetric polynomials characterization approach and derive an explicit expression for both the… ▽ More In this paper, we extend the existing double linear policy by incorporating time-varying weights instead of constant weights and study a certain robustness property, called robust positive expectation (RPE), in a discrete-time setting. We prove that the RPE property holds by employing a novel elementary symmetric polynomials characterization approach and derive an explicit expression for both the expected cumulative gain-loss function and its variance. To validate our theory, we perform extensive Monte Carlo simulations using various weighting functions. Furthermore, we demonstrate how this policy can be effectively incorporated with standard technical analysis techniques, using the moving average as a trading signal. △ Less

Submitted 19 March, 2023; originally announced March 2023.

Comments: Submitted for possible publication

MSC Class: 93E03; 93B35; 91-08

Journal ref: Proceedings of the IEEE Conference of Decision and Control (CDC), 2023

arXiv:2303.03634 [pdf]

PreFallKD: Pre-Impact Fall Detection via CNN-ViT Knowledge Distillation

Authors: Tin-Han Chi, Kai-Chun Liu, Chia-Yeh Hsieh, Yu Tsao, Chia-Tai Chan

Abstract: Fall accidents are critical issues in an aging and aged society. Recently, many researchers developed pre-impact fall detection systems using deep learning to support wearable-based fall protection systems for preventing severe injuries. However, most works only employed simple neural network models instead of complex models considering the usability in resource-constrained mobile devices and stri… ▽ More Fall accidents are critical issues in an aging and aged society. Recently, many researchers developed pre-impact fall detection systems using deep learning to support wearable-based fall protection systems for preventing severe injuries. However, most works only employed simple neural network models instead of complex models considering the usability in resource-constrained mobile devices and strict latency requirements. In this work, we propose a novel pre-impact fall detection via CNN-ViT knowledge distillation, namely PreFallKD, to strike a balance between detection performance and computational complexity. The proposed PreFallKD transfers the detection knowledge from the pre-trained teacher model (vision transformer) to the student model (lightweight convolutional neural networks). Additionally, we apply data augmentation techniques to tackle issues of data imbalance. We conduct the experiment on the KFall public dataset and compare PreFallKD with other state-of-the-art models. The experiment results show that PreFallKD could boost the student model during the testing phase and achieves reliable F1-score (92.66%) and lead time (551.3 ms). △ Less

Submitted 28 March, 2023; v1 submitted 6 March, 2023; originally announced March 2023.

arXiv:2302.13390 [pdf, other]

MDF-Net for abnormality detection by fusing X-rays with clinical data

Authors: Chihcheng Hsieh, Isabel Blanco Nobre, Sandra Costa Sousa, Chun Ouyang, Margot Brereton, Jacinto C. Nascimento, Joaquim Jorge, Catarina Moreira

Abstract: This study investigates the effects of including patients' clinical information on the performance of deep learning (DL) classifiers for disease location in chest X-ray images. Although current classifiers achieve high performance using chest X-ray images alone, our interviews with radiologists indicate that clinical data is highly informative and essential for interpreting images and making prope… ▽ More This study investigates the effects of including patients' clinical information on the performance of deep learning (DL) classifiers for disease location in chest X-ray images. Although current classifiers achieve high performance using chest X-ray images alone, our interviews with radiologists indicate that clinical data is highly informative and essential for interpreting images and making proper diagnoses. In this work, we propose a novel architecture consisting of two fusion methods that enable the model to simultaneously process patients' clinical data (structured data) and chest X-rays (image data). Since these data modalities are in different dimensional spaces, we propose a spatial arrangement strategy, spatialization, to facilitate the multimodal learning process in a Mask R-CNN model. We performed an extensive experimental evaluation using MIMIC-Eye, a dataset comprising modalities: MIMIC-CXR (chest X-ray images), MIMIC IV-ED (patients' clinical data), and REFLACX (annotations of disease locations in chest X-rays). Results show that incorporating patients' clinical data in a DL model together with the proposed fusion methods improves the disease localization in chest X-rays by 12\% in terms of Average Precision compared to a standard Mask R-CNN using only chest X-rays. Further ablation studies also emphasize the importance of multimodal DL architectures and the incorporation of patients' clinical data in disease localization. The architecture proposed in this work is publicly available to promote the scientific reproducibility of our study (https://github.com/ChihchengHsieh/multimodal-abnormalities-detection) △ Less

Submitted 27 December, 2023; v1 submitted 26 February, 2023; originally announced February 2023.

arXiv:2301.12258 [pdf, other]

Cross-domain Neural Pitch and Periodicity Estimation

Authors: Max Morrison, Caedon Hsieh, Nathan Pruyne, Bryan Pardo

Abstract: Pitch is a foundational aspect of our perception of audio signals. Pitch contours are commonly used to analyze speech and music signals and as input features for many audio tasks, including music transcription, singing voice synthesis, and prosody editing. In this paper, we describe a set of techniques for improving the accuracy of widely-used neural pitch and periodicity estimators to achieve sta… ▽ More Pitch is a foundational aspect of our perception of audio signals. Pitch contours are commonly used to analyze speech and music signals and as input features for many audio tasks, including music transcription, singing voice synthesis, and prosody editing. In this paper, we describe a set of techniques for improving the accuracy of widely-used neural pitch and periodicity estimators to achieve state-of-the-art performance on both speech and music. We also introduce a novel entropy-based method for extracting periodicity and per-frame voiced-unvoiced classifications from statistical inference-based pitch estimators (e.g., neural networks), and show how to train a neural pitch estimator to simultaneously handle both speech and music data (i.e., cross-domain estimation) without performance degradation. Our estimator implementations run 11.2x faster than real-time on a Intel i9-9820X 10-core 3.30 GHz CPU$\unicode{x2014}$approaching the speed of state-of-the-art DSP-based pitch estimators$\unicode{x2014}$or 408x faster than real-time on a NVIDIA GeForce RTX 3090 GPU. We release all of our code and models as Pitch-Estimating Neural Networks (penn), an open-source, pip-installable Python module for training, evaluating, and performing inference with pitch- and periodicity-estimating neural networks. The code for penn is available at https://github.com/interactiveaudiolab/penn. △ Less

Submitted 11 August, 2024; v1 submitted 28 January, 2023; originally announced January 2023.

arXiv:2301.02754 [pdf, ps, other]

On Frequency-Based Optimal Portfolio with Transaction Costs

Authors: Chung-Han Hsieh, Yi-Shan Wong

Abstract: The aim of this paper is to investigate the impact of rebalancing frequency and transaction costs on the log-optimal portfolio, which is a portfolio that maximizes the expected logarithmic growth rate of an investor's wealth. We prove that the frequency-dependent log-optimal portfolio problem with costs is equivalent to a concave program and provide a version of the dominance theorem with costs to… ▽ More The aim of this paper is to investigate the impact of rebalancing frequency and transaction costs on the log-optimal portfolio, which is a portfolio that maximizes the expected logarithmic growth rate of an investor's wealth. We prove that the frequency-dependent log-optimal portfolio problem with costs is equivalent to a concave program and provide a version of the dominance theorem with costs to determine when an investor should invest all available funds in a particular asset. Then, we show that transaction costs may cause a bankruptcy issue for the frequency-dependent log-optimal portfolio. To address this issue, we approximate the problem to obtain a quadratic concave program and derive necessary and sufficient optimality conditions. Additionally, we prove a version of the two-fund theorem, which states that any convex combination of two optimal weights from the optimality conditions is still optimal. We test our proposed methods using both intraday and daily price data. Finally, we extend our empirical studies to an online trading scenario by implementing a sliding window approach. This approach enables us to solve a sequence of concave programs rather than a potentially computational complex stochastic dynamic programming problem. △ Less

Submitted 6 January, 2023; originally announced January 2023.

Comments: Submitted for possible publication

MSC Class: 91B2; 91B32; 91B70

arXiv:2211.10354 [pdf, ps, other]

CRONOS: Colorization and Contrastive Learning for Device-Free NLoS Human Presence Detection using Wi-Fi CSI

Authors: Li-Hsiang Shen, Chia-Che Hsieh, An-Hung Hsiao, Kai-Ten Feng

Abstract: In recent years, the demand for pervasive smart services and applications has increased rapidly. Device-free human detection through sensors or cameras has been widely adopted, but it comes with privacy issues as well as misdetection for motionless people. To address these drawbacks, channel state information (CSI) captured from commercialized Wi-Fi devices provides rich signal features for accura… ▽ More In recent years, the demand for pervasive smart services and applications has increased rapidly. Device-free human detection through sensors or cameras has been widely adopted, but it comes with privacy issues as well as misdetection for motionless people. To address these drawbacks, channel state information (CSI) captured from commercialized Wi-Fi devices provides rich signal features for accurate detection. However, existing systems suffer from inaccurate classification under a non-line-of-sight (NLoS) and stationary scenario, such as when a person is standing still in a room corner. In this work, we propose a system called CRONOS (Colorization and Contrastive Learning Enhanced NLoS Human Presence Detection), which generates dynamic recurrence plots (RPs) and color-coded CSI ratios to distinguish mobile and stationary people from vacancy in a room, respectively. We also incorporate supervised contrastive learning to retrieve substantial representations, where consultation loss is formulated to differentiate the representative distances between dynamic and stationary cases. Furthermore, we propose a self-switched static feature enhanced classifier (S3FEC) to determine the utilization of either RPs or color-coded CSI ratios. Our comprehensive experimental results show that CRONOS outperforms existing systems that either apply machine learning or non-learning based methods, as well as non-CSI based features in open literature. CRONOS achieves the highest human presence detection accuracy in vacancy, mobility, line-of-sight (LoS), and NLoS scenarios. △ Less

Submitted 16 August, 2023; v1 submitted 7 November, 2022; originally announced November 2022.

Comments: Accepted by IEEE IoT-J

arXiv:2211.00585 [pdf, other]

Adapter-Based Extension of Multi-Speaker Text-to-Speech Model for New Speakers

Authors: Cheng-Ping Hsieh, Subhankar Ghosh, Boris Ginsburg

Abstract: Fine-tuning is a popular method for adapting text-to-speech (TTS) models to new speakers. However this approach has some challenges. Usually fine-tuning requires several hours of high quality speech per speaker. There is also that fine-tuning will negatively affect the quality of speech synthesis for previously learnt speakers. In this paper we propose an alternative approach for TTS adaptation ba… ▽ More Fine-tuning is a popular method for adapting text-to-speech (TTS) models to new speakers. However this approach has some challenges. Usually fine-tuning requires several hours of high quality speech per speaker. There is also that fine-tuning will negatively affect the quality of speech synthesis for previously learnt speakers. In this paper we propose an alternative approach for TTS adaptation based on using parameter-efficient adapter modules. In the proposed approach, a few small adapter modules are added to the original network. The original weights are frozen, and only the adapters are fine-tuned on speech for new speaker. The parameter-efficient fine-tuning approach will produce a new model with high level of parameter sharing with original model. Our experiments on LibriTTS, HiFi-TTS and VCTK datasets validate the effectiveness of adapter-based method through objective and subjective metrics. △ Less

Submitted 1 November, 2022; originally announced November 2022.

Comments: Submitted to ICASSP 2023

arXiv:2208.02232 [pdf, other]

GAS: Generating Fast and Accurate Surrogate Models for Autonomous Vehicle Systems

Authors: Keyur Joshi, Chiao Hsieh, Sayan Mitra, Sasa Misailovic

Abstract: Modern autonomous vehicle systems use complex perception and control components. These components can rapidly change during development of such systems, requiring constant re-testing. Unfortunately, high-fidelity simulations of these complex systems for evaluating vehicle safety are costly. The complexity also hinders the creation of less computationally intensive surrogate models. We present GA… ▽ More Modern autonomous vehicle systems use complex perception and control components. These components can rapidly change during development of such systems, requiring constant re-testing. Unfortunately, high-fidelity simulations of these complex systems for evaluating vehicle safety are costly. The complexity also hinders the creation of less computationally intensive surrogate models. We present GAS, the first approach for creating surrogate models of complete (perception, control, and dynamics) autonomous vehicle systems containing complex perception and/or control components. GAS's two-stage approach first replaces complex perception components with a perception model. Then, GAS constructs a polynomial surrogate model of the complete vehicle system using Generalized Polynomial Chaos (GPC). We demonstrate the use of these surrogate models in two applications. First, we estimate the probability that the vehicle will enter an unsafe state over time. Second, we perform global sensitivity analysis of the vehicle system with respect to its state in a previous time step. GAS's approach also allows for reuse of the perception model when vehicle control and dynamics characteristics are altered during vehicle development, saving significant time. We consider five scenarios concerning crop management vehicles that must not crash into adjacent crops, self driving cars that must stay within their lane, and unmanned aircraft that must avoid collision. Each of the systems in these scenarios contain a complex perception or control component. Using GAS, we generate surrogate models for these systems, and evaluate the generated models in the applications described above. GAS's surrogate models provide an average speedup of $3.7\times$ for safe state probability estimation (minimum $2.1\times$) and $1.4\times$ for sensitivity analysis (minimum $1.3\times$), while still maintaining high accuracy. △ Less

Submitted 13 July, 2023; v1 submitted 3 August, 2022; originally announced August 2022.

arXiv:2206.12148 [pdf, ps, other]

doi 10.1016/j.ifacol.2022.11.098

On Data-Driven Log-Optimal Portfolio: A Sliding Window Approach

Authors: Pei-Ting Wang, Chung-Han Hsieh

Abstract: In this paper, we propose a data-driven sliding window approach to solve a log-optimal portfolio problem. In contrast to many of the existing papers, this approach leads to a trading strategy with time-varying portfolio weights rather than fixed constant weights. We show, by conducting various empirical studies, that the approach possesses a superior trading performance to the classical log-optima… ▽ More In this paper, we propose a data-driven sliding window approach to solve a log-optimal portfolio problem. In contrast to many of the existing papers, this approach leads to a trading strategy with time-varying portfolio weights rather than fixed constant weights. We show, by conducting various empirical studies, that the approach possesses a superior trading performance to the classical log-optimal portfolio in the sense of having a higher cumulative rate of returns. △ Less

Submitted 24 June, 2022; originally announced June 2022.

Comments: To appear in the IFAC-PapersOnline (25th International Symposium on Mathematical Theory of Network and Systems)

Journal ref: IFAC-PapersOnline, vol. 55, no. 30, pp. 474-479, 2022

arXiv:2202.02300 [pdf, other]

From Semi-Infinite Constraints to Structured Robust Policies: Optimal Gain Selection for Financial Systems

Authors: Chung-Han Hsieh

Abstract: This paper studies the robust optimal gain selection problem for financial trading systems, formulated within a \emph{double linear policy} framework, which allocates capital across long and short positions. The key objective is to guarantee \emph{robust positive expected} (RPE) profits uniformly across a range of uncertain market conditions while ensuring risk control. This problem leads to a rob… ▽ More This paper studies the robust optimal gain selection problem for financial trading systems, formulated within a \emph{double linear policy} framework, which allocates capital across long and short positions. The key objective is to guarantee \emph{robust positive expected} (RPE) profits uniformly across a range of uncertain market conditions while ensuring risk control. This problem leads to a robust optimization formulation with \emph{semi-infinite} constraints, where the uncertainty is modeled by a bounded set of possible return parameters. We address this by transforming semi-infinite constraints into structured policies -- the \emph{balanced} policy and the \emph{complementary} policy -- which enable explicit characterization of the optimal solution. Additionally, we propose a novel graphical approach to efficiently solve the robust gain selection problem, drastically reducing computational complexity. Empirical validation on historical stock price data demonstrates superior performance in terms of risk-adjusted returns and downside risk compared to conventional strategies. This framework generalizes classical mean-variance optimization by incorporating robustness considerations, offering a systematic and efficient solution for robust trading under uncertainty. △ Less

Submitted 16 January, 2025; v1 submitted 4 February, 2022; originally announced February 2022.

Comments: Submitted for possible publication

MSC Class: 90C17; 93B35; 93E20; 90C34; 90C15; 91-10;

arXiv:2201.04065 [pdf, other]

ExBrainable: An Open-Source GUI for CNN-based EEG Decoding and Model Interpretation

Authors: Ya-Lin Huang, Chia-Ying Hsieh, Jian-Xue Huang, Chun-Shu Wei

Abstract: We have developed a graphic user interface (GUI), ExBrainable, dedicated to convolutional neural networks (CNN) model training and visualization in electroencephalography (EEG) decoding. Available functions include model training, evaluation, and parameter visualization in terms of temporal and spatial representations. We demonstrate these functions using a well-studied public dataset of motor-ima… ▽ More We have developed a graphic user interface (GUI), ExBrainable, dedicated to convolutional neural networks (CNN) model training and visualization in electroencephalography (EEG) decoding. Available functions include model training, evaluation, and parameter visualization in terms of temporal and spatial representations. We demonstrate these functions using a well-studied public dataset of motor-imagery EEG and compare the results with existing knowledge of neuroscience. The primary objective of ExBrainable is to provide a fast, simplified, and user-friendly solution of EEG decoding for investigators across disciplines to leverage cutting-edge methods in brain/neuroscience research. △ Less

Submitted 10 January, 2022; originally announced January 2022.

arXiv:2112.09177 [pdf, other]

Coherence Learning using Keypoint-based Pooling Network for Accurately Assessing Radiographic Knee Osteoarthritis

Authors: Kang Zheng, Yirui Wang, Chen-I Hsieh, Le Lu, Jing Xiao, Chang-Fu Kuo, Shun Miao

Abstract: Knee osteoarthritis (OA) is a common degenerate joint disorder that affects a large population of elderly people worldwide. Accurate radiographic assessment of knee OA severity plays a critical role in chronic patient management. Current clinically-adopted knee OA grading systems are observer subjective and suffer from inter-rater disagreements. In this work, we propose a computer-aided diagnosis… ▽ More Knee osteoarthritis (OA) is a common degenerate joint disorder that affects a large population of elderly people worldwide. Accurate radiographic assessment of knee OA severity plays a critical role in chronic patient management. Current clinically-adopted knee OA grading systems are observer subjective and suffer from inter-rater disagreements. In this work, we propose a computer-aided diagnosis approach to provide more accurate and consistent assessments of both composite and fine-grained OA grades simultaneously. A novel semi-supervised learning method is presented to exploit the underlying coherence in the composite and fine-grained OA grades by learning from unlabeled data. By representing the grade coherence using the log-probability of a pre-trained Gaussian Mixture Model, we formulate an incoherence loss to incorporate unlabeled data in training. The proposed method also describes a keypoint-based pooling network, where deep image features are pooled from the disease-targeted keypoints (extracted along the knee joint) to provide more aligned and pathologically informative feature representations, for accurate OA grade assessments. The proposed method is comprehensively evaluated on the public Osteoarthritis Initiative (OAI) data, a multi-center ten-year observational study on 4,796 subjects. Experimental results demonstrate that our method leads to significant improvements over previous strong whole image-based deep classification network baselines (like ResNet-50). △ Less

Submitted 16 December, 2021; originally announced December 2021.

Comments: extension of RSNA 2020 report "Consistent and Coherent Computer-Aided Knee Osteoarthritis Assessment from Plain Radiographs"

arXiv:2111.13312 [pdf]

doi 10.1109/BHI50953.2021.9508588

Instrumented shoulder functional assessment using inertial measurement units for frozen shoulder

Authors: Ting-Yang Lu, Kai-Chun Liu, Chia-Yeh Hsieh, Chih-Ya Chang, Yu Tsao, Chia-Tai Chan

Abstract: Frozen shoulder (FS) is a shoulder condition that leads to pain and loss of shoulder range of motion. FS patients have difficulties in independently performing daily activities. Inertial measurement units (IMUs) have been developed to objectively measure upper limb range of motion (ROM) and shoulder function. In this work, we propose an IMU-based shoulder functional task assessment with kinematic… ▽ More Frozen shoulder (FS) is a shoulder condition that leads to pain and loss of shoulder range of motion. FS patients have difficulties in independently performing daily activities. Inertial measurement units (IMUs) have been developed to objectively measure upper limb range of motion (ROM) and shoulder function. In this work, we propose an IMU-based shoulder functional task assessment with kinematic parameters (e.g., smoothness, power, speed, and duration) in FS patients and analyze the functional performance on complete shoulder tasks and subtasks. Twenty FS patients and twenty healthy subjects were recruited in this study. Five shoulder functional tasks are performed by participants, such as washing hair (WH), washing upper back (WUB), washing lower back (WLB), placing an object on a high shelf (POH), and removing an object from back pocket (ROP). The results demonstrate that the used smoothness features can reflect the differences of movement fluency between FS patients and healthy controls (p < 0.05 and effect size > 0.8). Moreover, features of subtasks provided subtle information related to clinical conditions that have not been revealed in features of a complete task, especially the defined subtask 1 and 2 of each task. △ Less

Submitted 25 November, 2021; originally announced November 2021.

Comments: 4 pages, 6 tables, 2 figures, To appear in 2021 IEEE BHI

arXiv:2104.15022 [pdf, other]

Deep Image Destruction: Vulnerability of Deep Image-to-Image Models against Adversarial Attacks

Authors: Jun-Ho Choi, Huan Zhang, Jun-Hyuk Kim, Cho-Jui Hsieh, Jong-Seok Lee

Abstract: Recently, the vulnerability of deep image classification models to adversarial attacks has been investigated. However, such an issue has not been thoroughly studied for image-to-image tasks that take an input image and generate an output image (e.g., colorization, denoising, deblurring, etc.) This paper presents comprehensive investigations into the vulnerability of deep image-to-image models to a… ▽ More Recently, the vulnerability of deep image classification models to adversarial attacks has been investigated. However, such an issue has not been thoroughly studied for image-to-image tasks that take an input image and generate an output image (e.g., colorization, denoising, deblurring, etc.) This paper presents comprehensive investigations into the vulnerability of deep image-to-image models to adversarial attacks. For five popular image-to-image tasks, 16 deep models are analyzed from various standpoints such as output quality degradation due to attacks, transferability of adversarial examples across different tasks, and characteristics of perturbations. We show that unlike image classification tasks, the performance degradation on image-to-image tasks largely differs depending on various factors, e.g., attack methods and task objectives. In addition, we analyze the effectiveness of conventional defense methods used for classification models in improving the robustness of the image-to-image models. △ Less

Submitted 28 June, 2022; v1 submitted 30 April, 2021; originally announced April 2021.

Comments: ICPR2022

arXiv:2012.14392 [pdf, other]

Adversarial Machine Learning in Wireless Communications using RF Data: A Review

Authors: Damilola Adesina, Chung-Chu Hsieh, Yalin E. Sagduyu, Lijun Qian

Abstract: Machine learning (ML) provides effective means to learn from spectrum data and solve complex tasks involved in wireless communications. Supported by recent advances in computational resources and algorithmic designs, deep learning (DL) has found success in performing various wireless communication tasks such as signal recognition, spectrum sensing and waveform design. However, ML in general and DL… ▽ More Machine learning (ML) provides effective means to learn from spectrum data and solve complex tasks involved in wireless communications. Supported by recent advances in computational resources and algorithmic designs, deep learning (DL) has found success in performing various wireless communication tasks such as signal recognition, spectrum sensing and waveform design. However, ML in general and DL in particular have been found vulnerable to manipulations thus giving rise to a field of study called adversarial machine learning (AML). Although AML has been extensively studied in other data domains such as computer vision and natural language processing, research for AML in the wireless communications domain is still in its early stage. This paper presents a comprehensive review of the latest research efforts focused on AML in wireless communications while accounting for the unique characteristics of wireless systems. First, the background of AML attacks on deep neural networks is discussed and a taxonomy of AML attack types is provided. Various methods of generating adversarial examples and attack mechanisms are also described. In addition, an holistic survey of existing research on AML attacks for various wireless communication problems as well as the corresponding defense mechanisms in the wireless domain are presented. Finally, as new attacks and defense techniques are developed, recent research trends and the overarching future outlook for AML for next-generation wireless communications are discussed. △ Less

Submitted 22 August, 2021; v1 submitted 28 December, 2020; originally announced December 2020.

Comments: 17 pages, 3 figures

arXiv:2012.10911 [pdf]

Domain-adaptive Fall Detection Using Deep Adversarial Training

Authors: Kai-Chun Liu, Michael Can, Heng-Cheng Kuo, Chia-Yeh Hsieh, Hsiang-Yun Huang, Chia-Tai Chan, Yu Tsao

Abstract: Fall detection (FD) systems are important assistive technologies for healthcare that can detect emergency fall events and alert caregivers. However, it is not easy to obtain large-scale annotated fall events with various specifications of sensors or sensor positions during the implementation of accurate FD systems. Moreover, the knowledge obtained through machine learning has been restricted to ta… ▽ More Fall detection (FD) systems are important assistive technologies for healthcare that can detect emergency fall events and alert caregivers. However, it is not easy to obtain large-scale annotated fall events with various specifications of sensors or sensor positions during the implementation of accurate FD systems. Moreover, the knowledge obtained through machine learning has been restricted to tasks in the same domain. The mismatch between different domains might hinder the performance of FD systems. Cross-domain knowledge transfer is very beneficial for machine-learning-based FD systems to train a reliable FD model with well-labeled data in new environments. In this study, we propose domain-adaptive fall detection (DAFD) using deep adversarial training (DAT) to tackle cross-domain problems, such as cross-position and cross-configuration. The proposed DAFD can transfer knowledge from the source domain to the target domain by minimizing the domain discrepancy to avoid mismatch problems. The experimental results show that the average F1-score improvement when using DAFD ranges from 1.5% to 7% in the cross-position scenario, and from 3.5% to 12% in the cross-configuration scenario, compared to using the conventional FD model without domain adaptation training. The results demonstrate that the proposed DAFD successfully helps to deal with cross-domain problems and to achieve better detection performance. △ Less

Submitted 14 June, 2021; v1 submitted 20 December, 2020; originally announced December 2020.

Comments: Accepted by IEEE Transactions on Neural Systems and Rehabilitation Engineering, 10 pages, 8 figures, 5 tables

arXiv:2012.03426 [pdf]

Deep Learning Based Signal Enhancement of Low-Resolution Accelerometer for Fall Detection Systems

Authors: Kai-Chun Liu, Kuo-Hsuan Hung, Chia-Yeh Hsieh, Hsiang-Yun Huang, Chia-Tai Chan, Yu Tsao

Abstract: In the last two decades, fall detection (FD) systems have been developed as a popular assistive technology. Such systems automatically detect critical fall events and immediately alert medical professionals or caregivers. To support long-term FD services, various power-saving strategies have been implemented. Among them, a reduced sampling rate is a common approach for an energy-efficient system i… ▽ More In the last two decades, fall detection (FD) systems have been developed as a popular assistive technology. Such systems automatically detect critical fall events and immediately alert medical professionals or caregivers. To support long-term FD services, various power-saving strategies have been implemented. Among them, a reduced sampling rate is a common approach for an energy-efficient system in the real-world. However, the performance of FD systems is diminished owing to low-resolution (LR) accelerometer signals. To improve the detection accuracy with LR accelerometer signals, several technical challenges must be considered, including misalignment, mismatch of effective features, and the degradation effects. In this work, a deep-learning-based accelerometer signal enhancement (ASE) model is proposed to improve the detection performance of LR-FD systems. This proposed model reconstructs high-resolution (HR) signals from the LR signals by learning the relationship between the LR and HR signals. The results show that the FD system using support vector machine and the proposed ASE model at an extremely low sampling rate (sampling rate < 2 Hz) achieved 97.34% and 90.52% accuracies in the SisFall and FallAllD datasets, respectively, while those without ASE models only achieved 95.92% and 87.47% accuracies in the SisFall and FallAllD datasets, respectively. This study demonstrates that the ASE model helps the FD systems tackle the technical challenges of LR signals and achieve better detection performance. △ Less

Submitted 27 September, 2021; v1 submitted 6 December, 2020; originally announced December 2020.

Comments: Accepted by IEEE Transactions on Cognitive and Developmental Systems, 12 pages, 7 figures, 8 tables

arXiv:2011.00790 [pdf, ps, other]

doi 10.1109/ACCESS.2021.3136191

On Control of Epidemics with Application to COVID-19

Authors: Chung-Han Hsieh

Abstract: At the time of writing, the ongoing COVID-19 pandemic, caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), had already resulted in more than thirty-two million cases infected and more than one million deaths worldwide. Given the fact that the pandemic is still threatening health and safety, it is in the urgency to understand the COVID-19 contagion process and know how it migh… ▽ More At the time of writing, the ongoing COVID-19 pandemic, caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), had already resulted in more than thirty-two million cases infected and more than one million deaths worldwide. Given the fact that the pandemic is still threatening health and safety, it is in the urgency to understand the COVID-19 contagion process and know how it might be controlled. With this motivation in mind, in this paper, we consider a version of a stochastic discrete-time Susceptible-Infected-Recovered-Death~(SIRD)-based epidemiological model with two uncertainties: The uncertain rate of infected cases which are undetected or asymptomatic, and the uncertain effectiveness rate of control. Our aim is to study the effect of an epidemic control policy on the uncertain model in a control-theoretic framework. We begin by providing the closed-form solutions of states in the modified SIRD-based model such as infected cases, susceptible cases, recovered cases, and deceased cases. Then, the corresponding expected states and the technical lower and upper bounds for those states are provided as well. Subsequently, we consider two epidemic control problems to be addressed: One is almost sure epidemic control problem and the other average epidemic control problem. Having defined the two problems, our main results are a set of sufficient conditions on a class of linear control policy which assures that the epidemic is "well-controlled"; i.e., both of the infected cases and deceased cases are upper bounded uniformly and the number of infected cases converges to zero asymptotically. Our numerical studies, using the historical COVID-19 contagion data in the United States, suggest that our appealingly simple model and control framework can provide a reasonable epidemic control performance compared to the ongoing pandemic situation. △ Less

Submitted 2 November, 2020; originally announced November 2020.

Comments: Submitted to the SIAM Journal on Control and Optimization

MSC Class: 93E03; 93D15; 92B05; 92D30

Journal ref: IEEE Access, vol. 9, pp. 167948-167958, 2021

arXiv:2006.05174 [pdf, other]

Input-independent Attention Weights Are Expressive Enough: A Study of Attention in Self-supervised Audio Transformers

Authors: Tsung-Han Wu, Chun-Chen Hsieh, Yen-Hao Chen, Po-Han Chi, Hung-yi Lee

Abstract: In this paper, we seek solutions for reducing the computation complexity of transformer-based models for speech representation learning. We evaluate 10 attention algorithms; then, we pre-train the transformer-based model with those attention algorithms in a self-supervised fashion and treat them as feature extractors on downstream tasks, including phoneme classification and speaker classification.… ▽ More In this paper, we seek solutions for reducing the computation complexity of transformer-based models for speech representation learning. We evaluate 10 attention algorithms; then, we pre-train the transformer-based model with those attention algorithms in a self-supervised fashion and treat them as feature extractors on downstream tasks, including phoneme classification and speaker classification. With the assistance of t-SNE, PCA and some observation, the attention weights in self-supervised audio transformers can be categorized into four general cases. Based on these cases and some analyses, we are able to use a specific set of attention weights to initialize the model. Our approach shows comparable performance to the typical self-attention yet requires 20% less time in both training and inference. △ Less

Submitted 3 November, 2020; v1 submitted 9 June, 2020; originally announced June 2020.

arXiv:2005.08575 [pdf, other]

Audio ALBERT: A Lite BERT for Self-supervised Learning of Audio Representation

Authors: Po-Han Chi, Pei-Hung Chung, Tsung-Han Wu, Chun-Cheng Hsieh, Yen-Hao Chen, Shang-Wen Li, Hung-yi Lee

Abstract: For self-supervised speech processing, it is crucial to use pretrained models as speech representation extractors. In recent works, increasing the size of the model has been utilized in acoustic model training in order to achieve better performance. In this paper, we propose Audio ALBERT, a lite version of the self-supervised speech representation model. We use the representations with two downstr… ▽ More For self-supervised speech processing, it is crucial to use pretrained models as speech representation extractors. In recent works, increasing the size of the model has been utilized in acoustic model training in order to achieve better performance. In this paper, we propose Audio ALBERT, a lite version of the self-supervised speech representation model. We use the representations with two downstream tasks, speaker identification, and phoneme classification. We show that Audio ALBERT is capable of achieving competitive performance with those huge models in the downstream tasks while utilizing 91\% fewer parameters. Moreover, we use some simple probing models to measure how much the information of the speaker and phoneme is encoded in latent representations. In probing experiments, we find that the latent representations encode richer information of both phoneme and speaker than that of the last layer. △ Less

Submitted 3 May, 2021; v1 submitted 18 May, 2020; originally announced May 2020.

Comments: Accepted by IEEE Spoken Language Technology Workshop 2021

arXiv:2004.12848 [pdf, ps, other]

doi 10.1016/j.automatica.2021.110051

Generalization of Affine Feedback Stock Trading Results to Include Stop-Loss Orders

Authors: Chung-Han Hsieh

Abstract: The takeoff point of this paper is to generalize the existing stock trading results for a class of affine feedback controller to include consideration of a stop-loss order. Using the geometric Brownian motion as the underlying stock price model, our main result is to provide a closed-form expression for the cumulative distribution function for the trading profit or loss. In addition, we show that… ▽ More The takeoff point of this paper is to generalize the existing stock trading results for a class of affine feedback controller to include consideration of a stop-loss order. Using the geometric Brownian motion as the underlying stock price model, our main result is to provide a closed-form expression for the cumulative distribution function for the trading profit or loss. In addition, we show that the affine feedback controller with stop-loss order indeed generalizes the result without stop order in the sense of distribution function. Some simulations and illustrative examples are also provided as supporting evidence of the theory. Moreover, we provide some technical results aimed at addressing the issues about survivability, cash-financing considerations, long-only property, and lower bound of the expected gain or loss. △ Less

Submitted 27 April, 2020; originally announced April 2020.

Comments: SIAM Journal on Control and Optimization (SICON)

MSC Class: 93E03; 91B02; 91B70

Journal ref: Automatica, vol. 136, pp. 110051:1-110051:7, 2022

arXiv:2004.12099 [pdf, ps, other]

doi 10.1109/LCSYS.2020.3002214

Necessary and Sufficient Conditions for Frequency-Based Kelly Optimal Portfolio

Authors: Chung-Han Hsieh

Abstract: In this paper, we consider a discrete-time portfolio with $m \geq 2$ assets optimization problem which includes the rebalancing~frequency as an additional parameter in the maximization. The so-called Kelly Criterion is used as the performance metric; i.e., maximizing the expected logarithmic growth of a trader's account, and the portfolio obtained is called the frequency-based Kelly optimal portfo… ▽ More In this paper, we consider a discrete-time portfolio with $m \geq 2$ assets optimization problem which includes the rebalancing~frequency as an additional parameter in the maximization. The so-called Kelly Criterion is used as the performance metric; i.e., maximizing the expected logarithmic growth of a trader's account, and the portfolio obtained is called the frequency-based Kelly optimal portfolio. The focal point of this paper is to extend upon the results of our previous work to obtain various optimality characterizations on the portfolio. To be more specific, using Kelly's criterion in our frequency-based formulation, we first prove necessary and sufficient conditions for the frequency-based Kelly optimal portfolio. With the aid of these conditions, we then show several new optimality characterizations such as expected ratio optimality and asymptotic relative optimality, and a result which we call the Extended Dominant Asset Theorem. That is, we prove that the $i$th asset is dominant in the portfolio if and only if the Kelly optimal portfolio consists of that asset only. The word "extended" on the theorem comes from the fact that it was only a sufficiency result that was proved in our previous work. Hence, in this paper, we improve it to involve a proof of the necessity part. In addition, the trader's survivability issue (no bankruptcy consideration) is also studied in detail in our frequency-based trading framework. Finally, to bridge the theory and practice, we propose a simple trading algorithm using the notion called dominant asset condition to decide when should one triggers a trade. The corresponding trading performance using historical price data is reported as supporting evidence. △ Less

Submitted 25 April, 2020; originally announced April 2020.

Comments: Submitted to IEEE Control Systems Letter

Journal ref: IEEE Control Systems Letter, vol 5, no 1, pp. 349-354, 2021

arXiv:1901.02480 [pdf, ps, other]

doi 10.1109/TAC.2019.2945885

On Positive Solutions of a Delay Equation Arising When Trading in Financial Markets

Authors: Chung-Han Hsieh, B. Ross Barmish, John A. Gubner

Abstract: We consider a discrete-time, linear state equation with delay which arises as a model for a trader's account value when buying and selling a risky asset in a financial market. The state equation includes a nonnegative feedback gain $α$ and a sequence $v(k)$ which models asset returns which are within known bounds but otherwise arbitrary. We introduce two thresholds, $α_-$ and $α_+$, depending on t… ▽ More We consider a discrete-time, linear state equation with delay which arises as a model for a trader's account value when buying and selling a risky asset in a financial market. The state equation includes a nonnegative feedback gain $α$ and a sequence $v(k)$ which models asset returns which are within known bounds but otherwise arbitrary. We introduce two thresholds, $α_-$ and $α_+$, depending on these bounds, and prove that for $α< α_-$, state positivity is guaranteed for all time and all asset-return sequences; i.e., bankruptcy is ruled out and positive solutions of the state equation are continuable indefinitely. On the other hand, for $α> α_+$, we show that there is always a sequence of asset returns for which the state fails to be positive for all time; i.e., along this sequence, bankruptcy is certain and the solution of the state equation ceases to be meaningful after some finite time. Finally, this paper also includes a conjecture which says that for the "gap" interval $α_- \leq α\leq α_+,$ state positivity is also guaranteed for all time. Support for the conjecture, both theoretical and computational, is provided. △ Less

Submitted 16 October, 2019; v1 submitted 8 January, 2019; originally announced January 2019.

Comments: Accepted to IEEE Transactions on Automatic Control

MSC Class: 93EXX

Journal ref: IEEE Transactions on Automatic Control, AC-65, no. 7, pp. 3143-3149, 2020

Showing 1–34 of 34 results for author: Hsieh, C