Skip to main content

Showing 1–23 of 23 results for author: Miao, J

Searching in archive eess. Search in all archives.
.
  1. arXiv:2506.01591  [pdf, ps, other

    cs.GR cs.CR cs.CV cs.SD eess.AS

    Silence is Golden: Leveraging Adversarial Examples to Nullify Audio Control in LDM-based Talking-Head Generation

    Authors: Yuan Gan, Jiaxu Miao, Yunze Wang, Yi Yang

    Abstract: Advances in talking-head animation based on Latent Diffusion Models (LDM) enable the creation of highly realistic, synchronized videos. These fabricated videos are indistinguishable from real ones, increasing the risk of potential misuse for scams, political manipulation, and misinformation. Hence, addressing these ethical concerns has become a pressing issue in AI security. Recent proactive defen… ▽ More

    Submitted 2 June, 2025; originally announced June 2025.

    Comments: Accepted to CVPR 2025

  2. arXiv:2405.14251   

    cs.RO eess.SY

    Efficient Navigation of a Robotic Fish Swimming Across the Vortical Flow Field

    Authors: Haodong Feng, Dehan Yuan, Jiale Miao, Jie You, Yue Wang, Yi Zhu, Dixia Fan

    Abstract: Navigating efficiently across vortical flow fields presents a significant challenge in various robotic applications. The dynamic and unsteady nature of vortical flows often disturbs the control of underwater robots, complicating their operation in hydrodynamic environments. Conventional control methods, which depend on accurate modeling, fail in these settings due to the complexity of fluid-struct… ▽ More

    Submitted 27 September, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

    Comments: We would like to request the withdrawal of our submission due to some misunderstandings among the co-authors concerning the submission process. It appears that the current version was submitted before we reached a consensus among all authors. We are actively working to address these matters and plan to resubmit a revised version once we achieve agreement

  3. arXiv:2403.16286  [pdf, other

    eess.IV cs.CV

    HemoSet: The First Blood Segmentation Dataset for Automation of Hemostasis Management

    Authors: Albert J. Miao, Shan Lin, Jingpei Lu, Florian Richter, Benjamin Ostrander, Emily K. Funk, Ryan K. Orosco, Michael C. Yip

    Abstract: Hemorrhaging occurs in surgeries of all types, forcing surgeons to quickly adapt to the visual interference that results from blood rapidly filling the surgical field. Introducing automation into the crucial surgical task of hemostasis management would offload mental and physical tasks from the surgeon and surgical assistants while simultaneously increasing the efficiency and safety of the operati… ▽ More

    Submitted 2 June, 2024; v1 submitted 24 March, 2024; originally announced March 2024.

  4. arXiv:2308.06614  [pdf, other

    eess.SY

    A Fog-based Smart Agriculture System to Detect Animal Intrusion

    Authors: Jinpeng Miao, Dasari Rajasekhar, Shivakant Mishra, Sanjeet Kumar Nayak, Ramanarayan Yadav

    Abstract: Smart agriculture is one of the most promising areas where IoT-enabled technologies have the potential to substantially improve the quality and quantity of the crops and reduce the associated operational cost. However, building a smart agriculture system presents several challenges, including high latency and bandwidth consumption associated with cloud computing, frequent Internet disconnections i… ▽ More

    Submitted 12 August, 2023; originally announced August 2023.

    Comments: 9 pages, 16 figures

  5. arXiv:2305.00416  [pdf, other

    eess.IV

    Quaternion Matrix Completion Using Untrained Quaternion Convolutional Neural Network for Color Image Inpainting

    Authors: Jifei Miao, Kit Ian Kou, Liqiao Yang, Juan Han

    Abstract: The use of quaternions as a novel tool for color image representation has yielded impressive results in color image processing. By considering the color image as a unified entity rather than separate color space components, quaternions can effectively exploit the strong correlation among the RGB channels, leading to enhanced performance. Especially, color image inpainting tasks are highly benefici… ▽ More

    Submitted 30 April, 2023; originally announced May 2023.

  6. arXiv:2212.08361  [pdf, other

    eess.IV

    Quaternion Tensor Completion with Sparseness for Color Video Recovery

    Authors: Liqiao Yang, Kit Ian Kou, Jifei Miao, Yang Liu, Maggie Pui Man Hoi

    Abstract: A novel low-rank completion algorithm based on the quaternion tensor is proposed in this paper. This approach uses the TQt-rank of quaternion tensor to maintain the structure of RGB channels throughout the entire process. In more detail, the pixels in each frame are encoded on three imaginary parts of a quaternion as an element in a quaternion matrix. Each quaternion matrix is then stacked into a… ▽ More

    Submitted 16 December, 2022; originally announced December 2022.

  7. arXiv:2211.12793  [pdf, other

    eess.IV

    Low Rank Quaternion Matrix Completion Based on Quaternion QR Decomposition and Sparse Regularizer

    Authors: Juan Han, Liqiao Yang, Kit Ian Kou, Jifei Miao, Lizhi Liu

    Abstract: Matrix completion is one of the most challenging problems in computer vision. Recently, quaternion representations of color images have achieved competitive performance in many fields. Because it treats the color image as a whole, the coupling information between the three channels of the color image is better utilized. Due to this, low-rank quaternion matrix completion (LRQMC) algorithms have gai… ▽ More

    Submitted 23 November, 2022; originally announced November 2022.

  8. arXiv:2210.16674  [pdf, other

    eess.IV cs.CV

    Semantic-SuPer: A Semantic-aware Surgical Perception Framework for Endoscopic Tissue Identification, Reconstruction, and Tracking

    Authors: Shan Lin, Albert J. Miao, Jingpei Lu, Shunkai Yu, Zih-Yun Chiu, Florian Richter, Michael C. Yip

    Abstract: Accurate and robust tracking and reconstruction of the surgical scene is a critical enabling technology toward autonomous robotic surgery. Existing algorithms for 3D perception in surgery mainly rely on geometric information, while we propose to also leverage semantic information inferred from the endoscopic video using image segmentation algorithms. In this paper, we present a novel, comprehensiv… ▽ More

    Submitted 20 February, 2023; v1 submitted 29 October, 2022; originally announced October 2022.

    Comments: IEEE International Conference on Robotics and Automation (ICRA) 2023

  9. arXiv:2209.02964  [pdf, other

    eess.IV

    Quaternion Tensor Train Rank Minimization with Sparse Regularization in a Transformed Domain for Quaternion Tensor Completion

    Authors: Jifei Miao, Kit Ian Kou, Liqiao Yang, Dong Cheng

    Abstract: The tensor train rank (TT-rank) has achieved promising results in tensor completion due to its ability to capture the global low-rankness of higher-order (>3) tensors. On the other hand, recently, quaternions have proven to be a very suitable framework for encoding color pixels, and have obtained outstanding performance in various color image processing tasks. In this paper, the quaternion tensor… ▽ More

    Submitted 7 September, 2022; originally announced September 2022.

  10. arXiv:2207.01287  [pdf, other

    eess.IV cs.CV

    FFCNet: Fourier Transform-Based Frequency Learning and Complex Convolutional Network for Colon Disease Classification

    Authors: Kai-Ni Wang, Yuting He, Shuaishuai Zhuang, Juzheng Miao, Xiaopu He, Ping Zhou, Guanyu Yang, Guang-Quan Zhou, Shuo Li

    Abstract: Reliable automatic classification of colonoscopy images is of great significance in assessing the stage of colonic lesions and formulating appropriate treatment plans. However, it is challenging due to uneven brightness, location variability, inter-class similarity, and intra-class dissimilarity, affecting the classification accuracy. To address the above issues, we propose a Fourier-based Frequen… ▽ More

    Submitted 4 July, 2022; originally announced July 2022.

    Comments: Accepted for publication at the 25th International Conference on Medical Image Computing and Computer Assisted Intervention - MICCAI 2022

  11. arXiv:2201.05344  [pdf, other

    eess.IV cs.CV cs.LG

    AWSnet: An Auto-weighted Supervision Attention Network for Myocardial Scar and Edema Segmentation in Multi-sequence Cardiac Magnetic Resonance Images

    Authors: Kai-Ni Wang, Xin Yang, Juzheng Miao, Lei Li, Jing Yao, Ping Zhou, Wufeng Xue, Guang-Quan Zhou, Xiahai Zhuang, Dong Ni

    Abstract: Multi-sequence cardiac magnetic resonance (CMR) provides essential pathology information (scar and edema) to diagnose myocardial infarction. However, automatic pathology segmentation can be challenging due to the difficulty of effectively exploring the underlying information from the multi-sequence CMR data. This paper aims to tackle the scar and edema segmentation from multi-sequence CMR with a n… ▽ More

    Submitted 14 January, 2022; originally announced January 2022.

    Comments: 19 pages, 10 figures, accepted by Medical Image Analysis

  12. arXiv:2112.13982  [pdf, other

    cs.CV eess.IV

    Quaternion-based dynamic mode decomposition for background modeling in color videos

    Authors: Juan Han, Kit Ian Kou, Jifei Miao

    Abstract: Scene Background Initialization (SBI) is one of the challenging problems in computer vision. Dynamic mode decomposition (DMD) is a recently proposed method to robustly decompose a video sequence into the background model and the corresponding foreground part. However, this method needs to convert the color image into the grayscale image for processing, which leads to the neglect of the coupling in… ▽ More

    Submitted 27 December, 2021; originally announced December 2021.

    Comments: 16 pages

  13. arXiv:2109.14797  [pdf, other

    cs.SD cs.AI cs.RO eess.AS

    Emergency Vehicles Audio Detection and Localization in Autonomous Driving

    Authors: Hongyi Sun, Xinyi Liu, Kecheng Xu, Jinghao Miao, Qi Luo

    Abstract: Emergency vehicles in service have right-of-way over all other vehicles. Hence, all other vehicles are supposed to take proper actions to yield emergency vehicles with active sirens. As this task requires the cooperation between ears and eyes for human drivers, it also needs audio detection as a supplement to vision-based algorithms for fully autonomous driving vehicles. In urban driving scenarios… ▽ More

    Submitted 1 October, 2021; v1 submitted 29 September, 2021; originally announced September 2021.

  14. arXiv:2107.01380  [pdf, other

    eess.IV

    Low Rank Quaternion Matrix Recovery via Logarithmic Approximation

    Authors: Liqiao Yang, Jifei Miao, Kit Ian Kou

    Abstract: In color image processing, image completion aims to restore missing entries from the incomplete observation image. Recently, great progress has been made in achieving completion by approximately solving the rank minimization problem. In this paper, we utilize a novel quaternion matrix logarithmic norm to approximate rank under the quaternion matrix framework. From one side, unlike the traditional… ▽ More

    Submitted 3 July, 2021; originally announced July 2021.

    Comments: 35 pages, 7 figures

  15. arXiv:2101.02443  [pdf, other

    eess.IV

    Weighted Truncated Nuclear Norm Regularization for Low-Rank Quaternion Matrix Completion

    Authors: Liqiao Yang, Kit Ian Kou, Jifei Miao

    Abstract: In recent years, quaternion matrix completion (QMC) based on low-rank regularization has been gradually used in image de-noising and de-blurring.Unlike low-rank matrix completion (LRMC) which handles RGB images by recovering each color channel separately, the QMC models utilize the connection of three channels by processing them as a whole. Most of the existing quaternion-based methods formulate l… ▽ More

    Submitted 7 January, 2021; originally announced January 2021.

  16. arXiv:2101.00364  [pdf, other

    eess.IV cs.CV math.NA

    Quaternion higher-order singular value decomposition and its applications in color image processing

    Authors: Jifei Miao, Kit Ian Kou

    Abstract: Higher-order singular value decomposition (HOSVD) is one of the most efficient tensor decomposition techniques. It has the salient ability to represent high_dimensional data and extract features. In more recent years, the quaternion has proven to be a very suitable tool for color pixel representation as it can well preserve cross-channel correlation of color channels. Motivated by the advantages o… ▽ More

    Submitted 1 January, 2021; originally announced January 2021.

  17. arXiv:2011.04250  [pdf

    cs.RO cs.LG eess.SY

    A Learning-Based Tune-Free Control Framework for Large Scale Autonomous Driving System Deployment

    Authors: Yu Wang, Shu Jiang, Weiman Lin, Yu Cao, Longtao Lin, Jiangtao Hu, Jinghao Miao, Qi Luo

    Abstract: This paper presents the design of a tune-free (human-out-of-the-loop parameter tuning) control framework, aiming at accelerating large scale autonomous driving system deployed on various vehicles and driving environments. The framework consists of three machine-learning-based procedures, which jointly automate the control parameter tuning for autonomous driving, including: a learning-based dynamic… ▽ More

    Submitted 9 November, 2020; originally announced November 2020.

    Comments: 8 pages, 12 figures

  18. arXiv:2010.09776  [pdf, other

    cs.MA cs.AI cs.GT cs.LG eess.SY

    SMARTS: Scalable Multi-Agent Reinforcement Learning Training School for Autonomous Driving

    Authors: Ming Zhou, Jun Luo, Julian Villella, Yaodong Yang, David Rusu, Jiayu Miao, Weinan Zhang, Montgomery Alban, Iman Fadakar, Zheng Chen, Aurora Chongxi Huang, Ying Wen, Kimia Hassanzadeh, Daniel Graves, Dong Chen, Zhengbang Zhu, Nhat Nguyen, Mohamed Elsayed, Kun Shao, Sanjeevan Ahilan, Baokuan Zhang, Jiannan Wu, Zhengang Fu, Kasra Rezaee, Peyman Yadmellat , et al. (12 additional authors not shown)

    Abstract: Multi-agent interaction is a fundamental aspect of autonomous driving in the real world. Despite more than a decade of research and development, the problem of how to competently interact with diverse road users in diverse scenarios remains largely unsolved. Learning methods have much to offer towards solving this problem. But they require a realistic multi-agent simulator that generates diverse a… ▽ More

    Submitted 31 October, 2020; v1 submitted 19 October, 2020; originally announced October 2020.

    Comments: 20 pages, 11 figures. Paper accepted to CoRL 2020

  19. arXiv:2006.00749  [pdf, other

    eess.IV math.NA

    Constrained low-rank quaternion approximation for color image denoising by bilateral random projections

    Authors: Jifei Miao, Kit Ian Kou

    Abstract: In this letter, we propose a novel low-rank quaternion approximation (LRQA) model by directly constraining the quaternion rank prior for effectively removing the noise in color images. The LRQA model treats the color image holistically rather than independently for the color space components, thus it can fully utilize the high correlation among RGB channels. We design an iterative algorithm by usi… ▽ More

    Submitted 1 June, 2020; originally announced June 2020.

  20. Quaternion-based bilinear factor matrix norm minimization for color image inpainting

    Authors: Jifei Miao, Kit Ian Kou

    Abstract: As a new color image representation tool, quaternion has achieved excellent results in the color image processing, because it treats the color image as a whole rather than as a separate color space component, thus it can make full use of the high correlation among RGB channels. Recently, low-rank quaternion matrix completion (LRQMC) methods have proven very useful for color image inpainting. In th… ▽ More

    Submitted 6 May, 2020; originally announced May 2020.

  21. arXiv:2004.10445  [pdf, other

    math.OC eess.IV

    RESIRE: real space iterative reconstruction engine for Tomography

    Authors: Minh Pham, Yakun Yuan, Arjun Rana, Jianwei Miao, Stanley Osher

    Abstract: Tomography has made a revolutionary impact on diverse fields, ranging from macro-/mesoscopic scale studies in biology, radiology, plasma physics to the characterization of 3D atomic structure in material science. The fundamental of tomography is to reconstruct a 3D object from a set of 2D projections. To solve the tomography problem, many algorithms have been developed. Among them are methods usin… ▽ More

    Submitted 25 April, 2020; v1 submitted 22 April, 2020; originally announced April 2020.

  22. arXiv:1909.06567  [pdf, other

    math.NA eess.IV

    Color image recovery using low-rank quaternion matrix completion algorithm

    Authors: Jifei Miao, Kit Ian Kou

    Abstract: As a new color image representation tool, quaternion has achieved excellent results in color image processing problems. In this paper, we propose a novel low-rank quaternion matrix completion algorithm to recover missing data of color image. Motivated by two kinds of low-rank approximation approaches (low-rank decomposition and nuclear norm minimization) in traditional matrix-based methods, we com… ▽ More

    Submitted 14 September, 2019; originally announced September 2019.

  23. arXiv:1906.01875  [pdf, other

    eess.IV math.OC

    A semi-implicit relaxed Douglas-Rachford algorithm (sir-DR) for Ptychograhpy

    Authors: Minh Pham, Arjun Rana, Jianwei Miao, Stanley Osher

    Abstract: Alternating projection based methods, such as ePIE and rPIE, have been used widely in ptychography. However, they only work well if there are adequate measurements (diffraction patterns); in the case of sparse data (i.e. fewer measurements) alternating projection underperforms and might not even converge. In this paper, we propose semi-implicit relaxed Douglas Rachford (sir-DR), an accelerated ite… ▽ More

    Submitted 5 June, 2019; originally announced June 2019.