Skip to main content

Showing 1–33 of 33 results for author: Yi, Y

Searching in archive eess. Search in all archives.
.
  1. arXiv:2507.04660  [pdf, ps, other

    eess.IV cs.CV

    CP-Dilatation: A Copy-and-Paste Augmentation Method for Preserving the Boundary Context Information of Histopathology Images

    Authors: Sungrae Hong, Sol Lee, Mun Yong Yi

    Abstract: Medical AI diagnosis including histopathology segmentation has derived benefits from the recent development of deep learning technology. However, deep learning itself requires a large amount of training data and the medical image segmentation masking, in particular, requires an extremely high cost due to the shortage of medical specialists. To mitigate this issue, we propose a new data augmentatio… ▽ More

    Submitted 7 July, 2025; originally announced July 2025.

    Comments: 5 pages, 5 figures

  2. arXiv:2506.12463  [pdf, ps, other

    eess.SY physics.soc-ph

    Adding links wisely: how an influencer seeks for leadership in opinion dynamics?

    Authors: Lingfei Wang, Yu Xing, Yuhao Yi, Ming Cao, Karl H. Johansson

    Abstract: This paper investigates the problem of leadership development for an external influencer using the Friedkin-Johnsen (FJ) opinion dynamics model, where the influencer is modeled as a fully stubborn agent and leadership is quantified by social power. The influencer seeks to maximize her social power by strategically adding a limited number of links to regular agents. This optimization problem is sho… ▽ More

    Submitted 14 June, 2025; originally announced June 2025.

  3. arXiv:2412.16567  [pdf

    eess.IV

    Federal Learning Framework for Quality Evaluation of Blastomere Cleavage

    Authors: Jung-Hua Wang, Huai-Wen Chang, Rong-Yu Wu, Ting-Yuan Wang, Ming-Jer Chen, Yu-Chiao Yi

    Abstract: This study addresses the issue of leveraging federated learning to improve data privacy and performance in IVF embryo selection. The EM (Expectation-Maximization) algorithm is incorporated into deep learning models to form a federated learning framework for quality evaluation of blastomere cleavage using two-dimensional images. The framework comprises a server site and several client sites charact… ▽ More

    Submitted 21 December, 2024; originally announced December 2024.

    Comments: 6 pages, 8 figures. Accepted by the 1st Workshop on Federated Learning for Unbounded and Intelligent Decentralization (FLUID). in conjunction with AAAI 2025 in Philadelphia, Pennsylvania, USA

  4. arXiv:2411.08307  [pdf, other

    cs.AI cs.MM cs.SD eess.AS

    PerceiverS: A Multi-Scale Perceiver with Effective Segmentation for Long-Term Expressive Symbolic Music Generation

    Authors: Yungang Yi, Weihua Li, Matthew Kuo, Quan Bai

    Abstract: AI-based music generation has progressed significantly in recent years. However, creating symbolic music that is both long-structured and expressive remains a considerable challenge. In this paper, we propose PerceiverS (Segmentation and Scale), a novel architecture designed to address this issue by leveraging both Effective Segmentation and Multi-Scale attention mechanisms. Our approach enhances… ▽ More

    Submitted 4 December, 2024; v1 submitted 12 November, 2024; originally announced November 2024.

  5. arXiv:2408.06645  [pdf

    eess.SY

    Dynamic Pricing of Electric Vehicle Charging Station Alliances Under Information Asymmetry

    Authors: Zeyu Liu, Yun Zhou, Donghan Feng, Shaolun Xu, Yin Yi, Hengjie Li, Haojing Wang

    Abstract: Due to the centralization of charging stations (CSs), CSs are organized as charging station alliances (CSAs) in the commercial competition. Under this situation, this paper studies the profit-oriented dynamic pricing strategy of CSAs. As the practicability basis, a privacy-protected bidirectional real-time information interaction framework is designed, under which the status of EVs is utilized as… ▽ More

    Submitted 13 August, 2024; originally announced August 2024.

  6. arXiv:2406.02430  [pdf, other

    eess.AS cs.SD

    Seed-TTS: A Family of High-Quality Versatile Speech Generation Models

    Authors: Philip Anastassiou, Jiawei Chen, Jitong Chen, Yuanzhe Chen, Zhuo Chen, Ziyi Chen, Jian Cong, Lelai Deng, Chuang Ding, Lu Gao, Mingqing Gong, Peisong Huang, Qingqing Huang, Zhiying Huang, Yuanyuan Huo, Dongya Jia, Chumin Li, Feiya Li, Hui Li, Jiaxin Li, Xiaoyang Li, Xingxing Li, Lin Liu, Shouda Liu, Sichao Liu , et al. (21 additional authors not shown)

    Abstract: We introduce Seed-TTS, a family of large-scale autoregressive text-to-speech (TTS) models capable of generating speech that is virtually indistinguishable from human speech. Seed-TTS serves as a foundation model for speech generation and excels in speech in-context learning, achieving performance in speaker similarity and naturalness that matches ground truth human speech in both objective and sub… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  7. arXiv:2405.18251  [pdf, other

    cs.RO eess.SY math.OC

    Sensor-Based Distributionally Robust Control for Safe Robot Navigation in Dynamic Environments

    Authors: Kehan Long, Yinzhuang Yi, Zhirui Dai, Sylvia Herbert, Jorge Cortés, Nikolay Atanasov

    Abstract: We introduce a novel method for mobile robot navigation in dynamic, unknown environments, leveraging onboard sensing and distributionally robust optimization to impose probabilistic safety constraints. Our method introduces a distributionally robust control barrier function (DR-CBF) that directly integrates noisy sensor measurements and state estimates to define safety constraints. This approach i… ▽ More

    Submitted 5 May, 2025; v1 submitted 28 May, 2024; originally announced May 2024.

    Comments: Project page: https://existentialrobotics.org/DRO_Safe_Navigation

  8. arXiv:2310.01363  [pdf, ps, other

    cs.RO eess.SY

    EAST: Environment Aware Safe Tracking using Planning and Control Co-Design

    Authors: Zhichao Li, Yinzhuang Yi, Zhuolin Niu, Nikolay Atanasov

    Abstract: This paper considers the problem of autonomous mobile robot navigation in unknown environments with moving obstacles. We propose a new method to achieve environment-aware safe tracking (EAST) of robot motion plans that integrates an obstacle clearance cost for path planning, a convex reachable set for robot motion prediction, and safety constraints for dynamic obstacle avoidance. EAST adapts the m… ▽ More

    Submitted 12 June, 2025; v1 submitted 2 October, 2023; originally announced October 2023.

  9. arXiv:2309.01072  [pdf, other

    eess.IV cs.CV

    Channel Attention Separable Convolution Network for Skin Lesion Segmentation

    Authors: Changlu Guo, Jiangyan Dai, Marton Szemenyei, Yugen Yi

    Abstract: Skin cancer is a frequently occurring cancer in the human population, and it is very important to be able to diagnose malignant tumors in the body early. Lesion segmentation is crucial for monitoring the morphological changes of skin lesions, extracting features to localize and identify diseases to assist doctors in early diagnosis. Manual de-segmentation of dermoscopic images is error-prone and t… ▽ More

    Submitted 3 September, 2023; originally announced September 2023.

    Comments: Accepted by ICONIP 2023

  10. arXiv:2207.12110  [pdf, ps, other

    eess.SY cs.SI

    A Sample-Based Algorithm for Approximately Testing $r$-Robustness of a Digraph

    Authors: Yuhao Yi, Yuan Wang, Xingkang He, Stacy Patterson, Karl H. Johansson

    Abstract: One of the intensely studied concepts of network robustness is $r$-robustness, which is a network topology property quantified by an integer $r$. It is required by mean subsequence reduced (MSR) algorithms and their variants to achieve resilient consensus. However, determining $r$-robustness is intractable for large networks. In this paper, we propose a sample-based algorithm to approximately test… ▽ More

    Submitted 25 July, 2022; originally announced July 2022.

    Comments: 8 pages, 3 figures

  11. arXiv:2207.09818  [pdf, other

    eess.SY cs.LG

    Operating Envelopes under Probabilistic Electricity Demand and Solar Generation Forecasts

    Authors: Yu Yi, Gregor Verbic

    Abstract: The increasing penetration of distributed energy resources in low-voltage networks is turning end-users from consumers to prosumers. However, the incomplete smart meter rollout and paucity of smart meter data due to the regulatory separation between retail and network service provision make active distribution network management difficult. Furthermore, distribution network operators oftentimes do… ▽ More

    Submitted 20 July, 2022; originally announced July 2022.

    Comments: In proceedings of the 11th Bulk Power Systems Dynamics and Control Symposium (IREP 2022), July 25-30, 2022, Banff, Canada

    Report number: IREP2022-79

  12. arXiv:2205.04421  [pdf, other

    eess.AS cs.AI cs.CL cs.LG cs.SD

    NaturalSpeech: End-to-End Text to Speech Synthesis with Human-Level Quality

    Authors: Xu Tan, Jiawei Chen, Haohe Liu, Jian Cong, Chen Zhang, Yanqing Liu, Xi Wang, Yichong Leng, Yuanhao Yi, Lei He, Frank Soong, Tao Qin, Sheng Zhao, Tie-Yan Liu

    Abstract: Text to speech (TTS) has made rapid progress in both academia and industry in recent years. Some questions naturally arise that whether a TTS system can achieve human-level quality, how to define/judge that quality and how to achieve it. In this paper, we answer these questions by first defining the human-level quality based on the statistical significance of subjective measure and introducing app… ▽ More

    Submitted 10 May, 2022; v1 submitted 9 May, 2022; originally announced May 2022.

    Comments: 19 pages, 3 figures, 8 tables

  13. arXiv:2204.12105  [pdf, other

    cs.CV eess.IV

    Learning Dual-Pixel Alignment for Defocus Deblurring

    Authors: Yu Li, Yaling Yi, Dongwei Ren, Qince Li, Wangmeng Zuo

    Abstract: It is a challenging task to recover sharp image from a single defocus blurry image in real-world applications. On many modern cameras, dual-pixel (DP) sensors create two-image views, based on which stereo information can be exploited to benefit defocus deblurring. Despite the impressive results achieved by existing DP defocus deblurring methods, the misalignment between DP image views is still not… ▽ More

    Submitted 19 February, 2023; v1 submitted 26 April, 2022; originally announced April 2022.

    Comments: Project page: https://github.com/liyucs/DPANet

  14. arXiv:2107.09404  [pdf, ps, other

    cs.IT eess.SP

    Maximizing the Set Cardinality of Users Scheduled for Ultra-dense uRLLC Networks

    Authors: Shiwen He, Jun Yuan, Zhenyu An, Yunshan Yi, Yongming Huang

    Abstract: Ultra-reliability and low latency communication has long been an important but challenging task in the fifth and sixth generation wireless communication systems. Scheduling as many users as possible to serve on the limited time-frequency resource is one of a crucial topic, subjecting to the maximum allowable transmission power and the minimum rate requirement of each user. We address it by proposi… ▽ More

    Submitted 9 September, 2021; v1 submitted 20 July, 2021; originally announced July 2021.

    Comments: 4 pages, 3 figures

  15. arXiv:2102.03688  [pdf, other

    eess.SP cs.AI

    Making Intelligent Reflecting Surfaces More Intelligent: A Roadmap Through Reservoir Computing

    Authors: Zhou Zhou, Kangjun Bai, Nima Mohammadi, Yang Yi, Lingjia Liu

    Abstract: This article introduces a neural network-based signal processing framework for intelligent reflecting surface (IRS) aided wireless communications systems. By modeling radio-frequency (RF) impairments inside the "meta-atoms" of IRS (including nonlinearity and memory effects), we present an approach that generalizes the entire IRS-aided system as a reservoir computing (RC) system, an efficient recur… ▽ More

    Submitted 6 February, 2021; originally announced February 2021.

  16. arXiv:2102.02998  [pdf, other

    eess.AS

    Beam-Guided TasNet: An Iterative Speech Separation Framework with Multi-Channel Output

    Authors: Hangting Chen, Yang Yi, Dang Feng, Pengyuan Zhang

    Abstract: Time-domain audio separation network (TasNet) has achieved remarkable performance in blind source separation (BSS). Classic multi-channel speech processing framework employs signal estimation and beamforming. For example, Beam-TasNet links multi-channel convolutional TasNet (MC-Conv-TasNet) with minimum variance distortionless response (MVDR) beamforming, which leverages the strong modeling abilit… ▽ More

    Submitted 12 April, 2022; v1 submitted 4 February, 2021; originally announced February 2021.

    Comments: Submitted to Inerspeech 2022

  17. arXiv:2009.08829  [pdf, other

    eess.IV cs.CV

    Residual Spatial Attention Network for Retinal Vessel Segmentation

    Authors: Changlu Guo, Márton Szemenyei, Yugen Yi, Wei Zhou, Haodong Bian

    Abstract: Reliable segmentation of retinal vessels can be employed as a way of monitoring and diagnosing certain diseases, such as diabetes and hypertension, as they affect the retinal vascular structure. In this work, we propose the Residual Spatial Attention Network (RSAN) for retinal vessel segmentation. RSAN employs a modified residual block structure that integrates DropBlock, which can not only be uti… ▽ More

    Submitted 18 September, 2020; originally announced September 2020.

    Comments: ICONIP 2020

  18. arXiv:2005.04097  [pdf, other

    cs.DC cs.LG eess.SP

    Delay-aware Resource Allocation in Fog-assisted IoT Networks Through Reinforcement Learning

    Authors: Qiang Fan, Jianan Bai, Hongxia Zhang, Yang Yi, Lingjia Liu

    Abstract: Fog nodes in the vicinity of IoT devices are promising to provision low latency services by offloading tasks from IoT devices to them. Mobile IoT is composed by mobile IoT devices such as vehicles, wearable devices and smartphones. Owing to the time-varying channel conditions, traffic loads and computing loads, it is challenging to improve the quality of service (QoS) of mobile IoT devices. As tas… ▽ More

    Submitted 10 July, 2020; v1 submitted 30 April, 2020; originally announced May 2020.

  19. arXiv:2004.03702  [pdf

    eess.IV cs.CV

    Channel Attention Residual U-Net for Retinal Vessel Segmentation

    Authors: Changlu Guo, Márton Szemenyei, Yangtao Hu, Wenle Wang, Wei Zhou, Yugen Yi

    Abstract: Retinal vessel segmentation is a vital step for the diagnosis of many early eye-related diseases. In this work, we propose a new deep learning model, namely Channel Attention Residual U-Net (CAR-UNet), to accurately segment retinal vascular and non-vascular pixels. In this model, we introduced a novel Modified Efficient Channel Attention (MECA) to enhance the discriminative ability of the network… ▽ More

    Submitted 20 October, 2020; v1 submitted 7 April, 2020; originally announced April 2020.

  20. arXiv:2004.03697  [pdf

    eess.IV cs.CV

    Dense Residual Network for Retinal Vessel Segmentation

    Authors: Changlu Guo, Márton Szemenyei, Yugen Yi, Ying Xue, Wei Zhou, Yangyuan Li

    Abstract: Retinal vessel segmentation plays an imaportant role in the field of retinal image analysis because changes in retinal vascular structure can aid in the diagnosis of diseases such as hypertension and diabetes. In recent research, numerous successful segmentation methods for fundus images have been proposed. But for other retinal imaging modalities, more research is needed to explore vascular extra… ▽ More

    Submitted 7 April, 2020; originally announced April 2020.

    Comments: Accepted by IEEE ICASSP 2020

  21. arXiv:2004.03696  [pdf

    eess.IV cs.CV

    SA-UNet: Spatial Attention U-Net for Retinal Vessel Segmentation

    Authors: Changlu Guo, Márton Szemenyei, Yugen Yi, Wenle Wang, Buer Chen, Changqi Fan

    Abstract: The precise segmentation of retinal blood vessels is of great significance for early diagnosis of eye-related diseases such as diabetes and hypertension. In this work, we propose a lightweight network named Spatial Attention U-Net (SA-UNet) that does not require thousands of annotated training samples and can be utilized in a data augmentation manner to use the available annotated samples more eff… ▽ More

    Submitted 20 October, 2020; v1 submitted 7 April, 2020; originally announced April 2020.

    Comments: ICPR 2020

  22. arXiv:2003.06923  [pdf, other

    eess.SP cs.LG

    RCNet: Incorporating Structural Information into Deep RNN for MIMO-OFDM Symbol Detection with Limited Training

    Authors: Zhou Zhou, Lingjia Liu, Shashank Jere, Jianzhong, Zhang, Yang Yi

    Abstract: In this paper, we investigate learning-based MIMO-OFDM symbol detection strategies focusing on a special recurrent neural network (RNN) -- reservoir computing (RC). We first introduce the Time-Frequency RC to take advantage of the structural information inherent in OFDM signals. Using the time domain RC and the time-frequency RC as the building blocks, we provide two extensions of the shallow RC t… ▽ More

    Submitted 15 March, 2020; originally announced March 2020.

  23. arXiv:2002.06109  [pdf, other

    cs.SI eess.SY

    Diffusion and Consensus in a Weakly Coupled Network of Networks

    Authors: Yuhao Yi, Anirban Das, Stacy Patterson, Bassam Bamieh, Zhongzhi Zhang

    Abstract: We study diffusion and consensus dynamics in a Network of Networks model. In this model, there is a collection of sub-networks, connected to one another using a small number of links. We consider a setting where the links between networks have small weights, or are used less frequently than links within each sub-network. Using spectral perturbation theory, we analyze the diffusion rate and converg… ▽ More

    Submitted 14 February, 2020; originally announced February 2020.

    Comments: 12 pages, 5 figures

  24. arXiv:2002.00275  [pdf, other

    eess.SY

    Data-Driven Stochastic Optimization for Power Grids Scheduling under High Wind Penetration

    Authors: Wei Xie, Yuan Yi, Zhi Zhou, Keqi Wang

    Abstract: To address the environmental concern and improve the economic efficiency, the wind power is rapidly integrated into smart grids. However, the inherent uncertainty of wind energy raises operational challenges. To ensure the cost-efficient, reliable and robust operation, it is critically important to find the optimal decision that can correctly and rigorously hedge against all sources of uncertainty… ▽ More

    Submitted 9 November, 2020; v1 submitted 1 February, 2020; originally announced February 2020.

    Comments: 24 pages, 2 figures

  25. arXiv:1911.11338  [pdf, ps, other

    cs.SI eess.SY math.OC

    Disagreement and Polarization in Two-Party Social Networks

    Authors: Yuhao Yi, Stacy Patterson

    Abstract: We investigate disagreement and polarization in a social network with two polarizing sources of information. First, we define disagreement and polarization indices in two-party leader-follower models of opinion dynamics. We then give expressions for the indices in terms of a graph Laplacian. The expressions show a relationship between these quantities and the concepts of resistance distance and bi… ▽ More

    Submitted 15 May, 2020; v1 submitted 25 November, 2019; originally announced November 2019.

    Comments: 8 pages, 1 figure

  26. arXiv:1910.13009  [pdf, ps, other

    cs.SI eess.SY math.OC

    Shifting Opinions in a Social Network Through Leader Selection

    Authors: Yuhao Yi, Timothy Castiglia, Stacy Patterson

    Abstract: We study the French-DeGroot opinion dynamics in a social network with two polarizing parties. We consider a network in which the leaders of one party are given, and we pose the problem of selecting the leader set of the opposing party so as to shift the average opinion to a desired value. When each party has only one leader, we express the average opinion in terms of the transition matrix and the… ▽ More

    Submitted 15 May, 2020; v1 submitted 28 October, 2019; originally announced October 2019.

    Comments: 14 pages, 4 figures

  27. arXiv:1909.11707  [pdf, other

    cs.CR eess.SP

    Implementation of three LWC Schemes in the WiFi 4-Way Handshake with Software Defined Radio

    Authors: Yunjie Yi, Guang Gong, Kalikinkar Mandal

    Abstract: With the rapid deployment of Internet of Things (IoT) devices in applications such as smarthomes, healthcare and industrial automation, security and privacy has become a major concern. Recently, National Institute of Standards and Technology (NIST) has initiated a lightweight cryptography (LWC) competition to standardize new cryptographic algorithm(s) for providing security in resource-constrained… ▽ More

    Submitted 30 September, 2021; v1 submitted 25 September, 2019; originally announced September 2019.

    Comments: NIST Lightweight Cryptography Workshop 2019

  28. arXiv:1906.10681  [pdf

    eess.IV physics.optics

    Metalens With Artificial Focus Pattern

    Authors: Mao Ye, Vishva Ray, Dachuan Wu, Yasha Yi

    Abstract: Metalens as one of the most popular applications of emmerging optical metasurfaces has raised widspread interest recently. With nano structures fully controlling phase, polarization and transmission, metalens has achieved comparable performance of commercial objective lenses. While recent studies seeking for the accomplishment of traditional focusing behaviors through metalens are successful, inth… ▽ More

    Submitted 25 June, 2019; originally announced June 2019.

    Comments: 9 pages, 6 figures

  29. arXiv:1906.08977  [pdf, other

    cs.SD cs.LG eess.AS

    Singing Voice Synthesis Using Deep Autoregressive Neural Networks for Acoustic Modeling

    Authors: Yuan-Hao Yi, Yang Ai, Zhen-Hua Ling, Li-Rong Dai

    Abstract: This paper presents a method of using autoregressive neural networks for the acoustic modeling of singing voice synthesis (SVS). Singing voice differs from speech and it contains more local dynamic movements of acoustic features, e.g., vibratos. Therefore, our method adopts deep autoregressive (DAR) models to predict the F0 and spectral features of singing voice in order to better describe the dep… ▽ More

    Submitted 21 June, 2019; originally announced June 2019.

    Comments: Interspeech2019

  30. arXiv:1809.04972  [pdf, other

    eess.SY cs.GT math.OC

    Simulation-based Distributed Coordination Maximization over Networks

    Authors: Hyeryung Jang, Jinwoo Shin, Yung Yi

    Abstract: In various online/offline multi-agent networked environments, it is very popular that the system can benefit from coordinating actions of two interacting agents at some cost of coordination. In this paper, we first formulate an optimization problem that captures the amount of coordination gain at the cost of node activation over networks. This problem is challenging to solve in a distributed manne… ▽ More

    Submitted 13 September, 2018; originally announced September 2018.

    Comments: 34 pages, 4 figures. A shorter version of this paper appeared in Proceedings of ACM Mobile Ad Hoc Networking and Computing (MOBIHOC), 2016. To appear at IEEE Transactions on Control of Network Systems, 2018

  31. arXiv:1801.00535  [pdf, ps, other

    eess.SY cs.SI

    Scale-free Loopy Structure is Resistant to Noise in Consensus Dynamics in Complex Networks

    Authors: Yuhao Yi, Zhongzhi Zhang, Stacy Patterson

    Abstract: The vast majority of real-world networks are scale-free, loopy, and sparse, with a power-law degree distribution and a constant average degree. In this paper, we study first-order consensus dynamics in binary scale-free networks, where vertices are subject to white noise. We focus on the coherence of networks characterized in terms of the $H_2$-norm, which quantifies how closely agents track the c… ▽ More

    Submitted 1 January, 2018; originally announced January 2018.

    Comments: 10 pages, 4 figures

  32. arXiv:1712.06496  [pdf, ps, other

    eess.SY cs.SI

    Consensus in Self-similar Hierarchical Graphs and Sierpiński Graphs: Convergence Speed, Delay Robustness, and Coherence

    Authors: Yi Qi, Zhongzhi Zhang, Yuhao Yi, Huan Li

    Abstract: The hierarchical graphs and Sierpiński graphs are constructed iteratively, which have the same number of vertices and edges at any iteration, but exhibit quite different structural properties: the hierarchical graphs are non-fractal and small-world, while the Sierpiński graphs are fractal and "large-world". Both graphs have found broad applications. In this paper, we study consensus problems in hi… ▽ More

    Submitted 18 December, 2017; originally announced December 2017.

    Comments: To be published on IEEE Transactions on Cybernetics

  33. arXiv:1708.06873  [pdf, other

    math.OC eess.SY

    A Resistance Distance-Based Approach for Optimal Leader Selection in Noisy Consensus Networks

    Authors: Stacy Patterson, Yuhao Yi, Zhongzhi Zhang

    Abstract: We study the performance of leader-follower noisy consensus networks, and in particular, the relationship between this performance and the locations of the leader nodes. Two types of dynamics are considered (1) noise-free leaders, in which leaders dictate the trajectory exactly and followers are subject to external disturbances, and (2) noise-corrupted leaders, in which both leaders and followers… ▽ More

    Submitted 22 August, 2017; originally announced August 2017.