Skip to main content

Showing 1–50 of 59 results for author: Shao, L

Searching in archive eess. Search in all archives.
.
  1. arXiv:2503.12461  [pdf, other

    cs.CV eess.IV

    MambaIC: State Space Models for High-Performance Learned Image Compression

    Authors: Fanhu Zeng, Hao Tang, Yihua Shao, Siyu Chen, Ling Shao, Yan Wang

    Abstract: A high-performance image compression algorithm is crucial for real-time information transmission across numerous fields. Despite rapid progress in image compression, computational inefficiency and poor redundancy modeling still pose significant bottlenecks, limiting practical applications. Inspired by the effectiveness of state space models (SSMs) in capturing long-range dependencies, we leverage… ▽ More

    Submitted 19 March, 2025; v1 submitted 16 March, 2025; originally announced March 2025.

    Comments: Accepted to CVPR 2025

  2. arXiv:2503.00380  [pdf, other

    eess.SY

    Adaptive Wall-Following Control for Unmanned Ground Vehicles Using Spiking Neural Networks

    Authors: Hengye Yang, Yanxiao Chen, Zexuan Fan, Lin Shao, Tao Sun

    Abstract: Unmanned ground vehicles operating in complex environments must adaptively adjust to modeling uncertainties and external disturbances to perform tasks such as wall following and obstacle avoidance. This paper introduces an adaptive control approach based on spiking neural networks for wall fitting and tracking, which learns and adapts to unforeseen disturbances. We propose real-time wall-fitting a… ▽ More

    Submitted 1 March, 2025; originally announced March 2025.

  3. arXiv:2411.12278  [pdf, other

    eess.IV cs.CV

    Versatile Cataract Fundus Image Restoration Model Utilizing Unpaired Cataract and High-quality Images

    Authors: Zheng Gong, Zhuo Deng, Weihao Gao, Wenda Zhou, Yuhang Yang, Hanqing Zhao, Zhiyuan Niu, Lei Shao, Wenbin Wei, Lan Ma

    Abstract: Cataract is one of the most common blinding eye diseases and can be treated by surgery. However, because cataract patients may also suffer from other blinding eye diseases, ophthalmologists must diagnose them before surgery. The cloudy lens of cataract patients forms a hazy degeneration in the fundus images, making it challenging to observe the patient's fundus vessels, which brings difficulties t… ▽ More

    Submitted 19 November, 2024; originally announced November 2024.

    Comments: 12 pages, 8 figures

  4. arXiv:2407.14198  [pdf

    cs.CV eess.IV

    Double-Shot 3D Shape Measurement with a Dual-Branch Network for Structured Light Projection Profilometry

    Authors: Mingyang Lei, Jingfan Fan, Long Shao, Hong Song, Deqiang Xiao, Danni Ai, Tianyu Fu, Ying Gu, Jian Yang

    Abstract: The structured light (SL)-based three-dimensional (3D) measurement techniques with deep learning have been widely studied to improve measurement efficiency, among which fringe projection profilometry (FPP) and speckle projection profilometry (SPP) are two popular methods. However, they generally use a single projection pattern for reconstruction, resulting in fringe order ambiguity or poor reconst… ▽ More

    Submitted 9 December, 2024; v1 submitted 19 July, 2024; originally announced July 2024.

  5. arXiv:2407.04737  [pdf, other

    eess.SP cs.AI

    Hierarchical Decoupling Capacitor Optimization for Power Distribution Network of 2.5D ICs with Co-Analysis of Frequency and Time Domains Based on Deep Reinforcement Learning

    Authors: Yuanyuan Duan, Haiyang Feng, Zhiping Yu, Hanming Wu, Leilai Shao, Xiaolei Zhu

    Abstract: With the growing need for higher memory bandwidth and computation density, 2.5D design, which involves integrating multiple chiplets onto an interposer, emerges as a promising solution. However, this integration introduces significant challenges due to increasing data rates and a large number of I/Os, necessitating advanced optimization of the power distribution networks (PDNs) both on-chip and on… ▽ More

    Submitted 20 May, 2025; v1 submitted 2 July, 2024; originally announced July 2024.

    Comments: The data needs to be experimentally revalidated, and the experimental details require further optimization

  6. arXiv:2407.03245  [pdf, other

    cs.RO cs.AI eess.SY

    TieBot: Learning to Knot a Tie from Visual Demonstration through a Real-to-Sim-to-Real Approach

    Authors: Weikun Peng, Jun Lv, Yuwei Zeng, Haonan Chen, Siheng Zhao, Jichen Sun, Cewu Lu, Lin Shao

    Abstract: The tie-knotting task is highly challenging due to the tie's high deformation and long-horizon manipulation actions. This work presents TieBot, a Real-to-Sim-to-Real learning from visual demonstration system for the robots to learn to knot a tie. We introduce the Hierarchical Feature Matching approach to estimate a sequence of tie's meshes from the demonstration video. With these estimated meshes… ▽ More

    Submitted 19 October, 2024; v1 submitted 3 July, 2024; originally announced July 2024.

    Comments: Accepted by CoRL 2024 as Oral presentation, camera-ready version

  7. arXiv:2404.03179  [pdf, other

    cs.CV cs.MM cs.SD eess.AS

    UniAV: Unified Audio-Visual Perception for Multi-Task Video Event Localization

    Authors: Tiantian Geng, Teng Wang, Yanfu Zhang, Jinming Duan, Weili Guan, Feng Zheng, Ling shao

    Abstract: Video localization tasks aim to temporally locate specific instances in videos, including temporal action localization (TAL), sound event detection (SED) and audio-visual event localization (AVEL). Existing methods over-specialize on each task, overlooking the fact that these instances often occur in the same video to form the complete video content. In this work, we present UniAV, a Unified Audio… ▽ More

    Submitted 11 August, 2024; v1 submitted 3 April, 2024; originally announced April 2024.

    Comments: This work has been submitted to the IEEE for possible publication

  8. arXiv:2312.06454  [pdf, other

    eess.IV cs.CV cs.LG

    Point Transformer with Federated Learning for Predicting Breast Cancer HER2 Status from Hematoxylin and Eosin-Stained Whole Slide Images

    Authors: Bao Li, Zhenyu Liu, Lizhi Shao, Bensheng Qiu, Hong Bu, Jie Tian

    Abstract: Directly predicting human epidermal growth factor receptor 2 (HER2) status from widely available hematoxylin and eosin (HE)-stained whole slide images (WSIs) can reduce technical costs and expedite treatment selection. Accurately predicting HER2 requires large collections of multi-site WSIs. Federated learning enables collaborative training of these WSIs without gigabyte-size WSIs transportation a… ▽ More

    Submitted 27 February, 2024; v1 submitted 11 December, 2023; originally announced December 2023.

  9. Synergistic Perception and Control Simplex for Verifiable Safe Vertical Landing

    Authors: Ayoosh Bansal, Yang Zhao, James Zhu, Sheng Cheng, Yuliang Gu, Hyung-Jin Yoon, Hunmin Kim, Naira Hovakimyan, Lui Sha

    Abstract: Perception, Planning, and Control form the essential components of autonomy in advanced air mobility. This work advances the holistic integration of these components to enhance the performance and robustness of the complete cyber-physical system. We adapt Perception Simplex, a system for verifiable collision avoidance amidst obstacle detection faults, to the vertical landing maneuver for autonomou… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

    Comments: To appear in AIAA SciTech 2024

    ACM Class: C.3; C.4; J.7

    Journal ref: AIAA SCITECH 2024 Forum, p. 1167

  10. arXiv:2309.04710  [pdf, other

    cs.RO cs.AI cs.CV cs.GR eess.SY

    Jade: A Differentiable Physics Engine for Articulated Rigid Bodies with Intersection-Free Frictional Contact

    Authors: Gang Yang, Siyuan Luo, Lin Shao

    Abstract: We present Jade, a differentiable physics engine for articulated rigid bodies. Jade models contacts as the Linear Complementarity Problem (LCP). Compared to existing differentiable simulations, Jade offers features including intersection-free collision simulation and stable LCP solutions for multiple frictional contacts. We use continuous collision detection to detect the time of impact and adopt… ▽ More

    Submitted 9 September, 2023; originally announced September 2023.

  11. arXiv:2306.06102  [pdf, other

    eess.SY

    Backup Plan Constrained Model Predictive Control with Guaranteed Stability

    Authors: Ran Tao, Hunmin Kim, Hyung-Jin Yoon, Wenbin Wan, Naira Hovakimyan, Lui Sha, Petros Voulgaris

    Abstract: This article proposes and evaluates a new safety concept called backup plan safety for path planning of autonomous vehicles under mission uncertainty using model predictive control (MPC). Backup plan safety is defined as the ability to complete an alternative mission when the primary mission is aborted. To include this new safety concept in control problems, we formulate a feasibility maximization… ▽ More

    Submitted 6 October, 2023; v1 submitted 9 June, 2023; originally announced June 2023.

  12. arXiv:2303.16860  [pdf, other

    cs.LG eess.SY

    Physical Deep Reinforcement Learning Towards Safety Guarantee

    Authors: Hongpeng Cao, Yanbing Mao, Lui Sha, Marco Caccamo

    Abstract: Deep reinforcement learning (DRL) has achieved tremendous success in many complex decision-making tasks of autonomous systems with high-dimensional state and/or action spaces. However, the safety and stability still remain major concerns that hinder the applications of DRL to safety-critical autonomous systems. To address the concerns, we proposed the Phy-DRL: a physical deep reinforcement learnin… ▽ More

    Submitted 29 March, 2023; originally announced March 2023.

    Comments: Working Paper

  13. arXiv:2212.14735  [pdf

    eess.SP

    Compressed domain vibration detection and classification for distributed acoustic sensing

    Authors: Xingliang Shen, Huan Wu, Kun Zhu, Yujia Li, Hua Zheng, Jialong Li, Liyang Shao, Perry Ping Shum, Chao Lu

    Abstract: Distributed acoustic sensing (DAS) is a novel enabling technology that can turn existing fibre optic networks to distributed acoustic sensors. However, it faces the challenges of transmitting, storing, and processing massive streams of data which are orders of magnitude larger than that collected from point sensors. The gap between intensive data generated by DAS and modern computing system with l… ▽ More

    Submitted 27 December, 2022; originally announced December 2022.

  14. arXiv:2209.01710  [pdf, other

    cs.RO cs.LG eess.SY

    Perception Simplex: Verifiable Collision Avoidance in Autonomous Vehicles Amidst Obstacle Detection Faults

    Authors: Ayoosh Bansal, Hunmin Kim, Simon Yu, Bo Li, Naira Hovakimyan, Marco Caccamo, Lui Sha

    Abstract: Advances in deep learning have revolutionized cyber-physical applications, including the development of Autonomous Vehicles. However, real-world collisions involving autonomous control of vehicles have raised significant safety concerns regarding the use of Deep Neural Networks (DNN) in safety-critical tasks, particularly Perception. The inherent unverifiability of DNNs poses a key challenge in en… ▽ More

    Submitted 28 November, 2023; v1 submitted 4 September, 2022; originally announced September 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:2208.14403

    ACM Class: D.2.11; I.2.9; C.4; J.7

    Journal ref: Software Testing, Verification and Reliability. 2024. e1879

  15. Verifiable Obstacle Detection

    Authors: Ayoosh Bansal, Hunmin Kim, Simon Yu, Bo Li, Naira Hovakimyan, Marco Caccamo, Lui Sha

    Abstract: Perception of obstacles remains a critical safety concern for autonomous vehicles. Real-world collisions have shown that the autonomy faults leading to fatal collisions originate from obstacle existence detection. Open source autonomous driving implementations show a perception pipeline with complex interdependent Deep Neural Networks. These networks are not fully verifiable, making them unsuitabl… ▽ More

    Submitted 30 August, 2022; originally announced August 2022.

    Comments: Accepted at ISSRE 2022

    ACM Class: D.2.4; I.2.9; I.4.8

    Journal ref: 33rd International Symposium on Software Reliability Engineering (ISSRE), pp. 61-72. IEEE, 2022

  16. arXiv:2205.01649  [pdf, other

    eess.IV cs.CV

    Learning Enriched Features for Fast Image Restoration and Enhancement

    Authors: Syed Waqas Zamir, Aditya Arora, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, Ming-Hsuan Yang, Ling Shao

    Abstract: Given a degraded input image, image restoration aims to recover the missing high-quality image content. Numerous applications demand effective image restoration, e.g., computational photography, surveillance, autonomous vehicles, and remote sensing. Significant advances in image restoration have been made in recent years, dominated by convolutional neural networks (CNNs). The widely-used CNN-based… ▽ More

    Submitted 19 April, 2022; originally announced May 2022.

    Comments: This article supersedes arXiv:2003.06792. Accepted for publication in TPAMI

  17. arXiv:2112.05752  [pdf, other

    eess.IV cs.CV

    Specificity-Preserving Federated Learning for MR Image Reconstruction

    Authors: Chun-Mei Feng, Yunlu Yan, Shanshan Wang, Yong Xu, Ling Shao, Huazhu Fu

    Abstract: Federated learning (FL) can be used to improve data privacy and efficiency in magnetic resonance (MR) image reconstruction by enabling multiple institutions to collaborate without needing to aggregate local data. However, the domain shift caused by different MR imaging protocols can substantially degrade the performance of FL models. Recent FL techniques tend to solve this by enhancing the general… ▽ More

    Submitted 22 August, 2022; v1 submitted 9 December, 2021; originally announced December 2021.

    Comments: 12 pages, 8 figures Code: https://github.com/chunmeifeng/FedMRI

    Journal ref: IEEE Transactions on Medical Imaging, 2022

  18. arXiv:2110.08080  [pdf, other

    eess.IV cs.CV

    Deep multi-modal aggregation network for MR image reconstruction with auxiliary modality

    Authors: Chun-Mei Feng, Huazhu Fu, Tianfei Zhou, Yong Xu, Ling Shao, David Zhang

    Abstract: Magnetic resonance (MR) imaging produces detailed images of organs and tissues with better contrast, but it suffers from a long acquisition time, which makes the image quality vulnerable to say motion artifacts. Recently, many approaches have been developed to reconstruct full-sampled images from partially observed measurements to accelerate MR imaging. However, most approaches focused on reconstr… ▽ More

    Submitted 21 February, 2022; v1 submitted 15 October, 2021; originally announced October 2021.

  19. arXiv:2109.01664  [pdf, other

    eess.IV cs.CV

    Exploring Separable Attention for Multi-Contrast MR Image Super-Resolution

    Authors: Chun-Mei Feng, Yunlu Yan, Kai Yu, Yong Xu, Ling Shao, Huazhu Fu

    Abstract: Super-resolving the Magnetic Resonance (MR) image of a target contrast under the guidance of the corresponding auxiliary contrast, which provides additional anatomical information, is a new and effective solution for fast MR imaging. However, current multi-contrast super-resolution (SR) methods tend to concatenate different contrasts directly, ignoring their relationships in different clues, e.g.,… ▽ More

    Submitted 21 August, 2022; v1 submitted 3 September, 2021; originally announced September 2021.

    Comments: arXiv admin note: text overlap with arXiv:2105.08949 https://github.com/chunmeifeng/SANet

  20. Polyp-PVT: Polyp Segmentation with Pyramid Vision Transformers

    Authors: Bo Dong, Wenhai Wang, Deng-Ping Fan, Jinpeng Li, Huazhu Fu, Ling Shao

    Abstract: Most polyp segmentation methods use CNNs as their backbone, leading to two key issues when exchanging information between the encoder and decoder: 1) taking into account the differences in contribution between different-level features and 2) designing an effective mechanism for fusing these features. Unlike existing CNN-based methods, we adopt a transformer encoder, which learns more powerful and… ▽ More

    Submitted 19 February, 2024; v1 submitted 16 August, 2021; originally announced August 2021.

    Comments: Accepted to CAAI AIR 2023

    Journal ref: CAAI Artificial Intelligence Research, 2023, 2: 9150015

  21. arXiv:2107.07314  [pdf, other

    cs.CV cs.LG eess.IV

    Variational Topic Inference for Chest X-Ray Report Generation

    Authors: Ivona Najdenkoska, Xiantong Zhen, Marcel Worring, Ling Shao

    Abstract: Automating report generation for medical imaging promises to reduce workload and assist diagnosis in clinical practice. Recent work has shown that deep learning models can successfully caption natural images. However, learning from medical data is challenging due to the diversity and uncertainty inherent in the reports written by different radiologists with discrepant expertise and experience. To… ▽ More

    Submitted 15 July, 2021; originally announced July 2021.

    Comments: To be published in the International Conference on Medical Image Computing and Computer Assisted Intervention 2021

  22. arXiv:2106.14248  [pdf, other

    eess.IV cs.CV

    Multi-Modal Transformer for Accelerated MR Imaging

    Authors: Chun-Mei Feng, Yunlu Yan, Geng Chen, Yong Xu, Ling Shao, Huazhu Fu

    Abstract: Accelerated multi-modal magnetic resonance (MR) imaging is a new and effective solution for fast MR imaging, providing superior performance in restoring the target modality from its undersampled counterpart with guidance from an auxiliary modality. However, existing works simply combine the auxiliary modality as prior information, lacking in-depth investigations on the potential mechanisms for fus… ▽ More

    Submitted 11 May, 2022; v1 submitted 27 June, 2021; originally announced June 2021.

    Comments: https://github.com/chunmeifeng/MTrans

  23. arXiv:2105.05980  [pdf, other

    eess.IV cs.CV

    DONet: Dual-Octave Network for Fast MR Image Reconstruction

    Authors: Chun-Mei Feng, Zhanyuan Yang, Huazhu Fu, Yong Xu, Jian Yang, Ling Shao

    Abstract: Magnetic resonance (MR) image acquisition is an inherently prolonged process, whose acceleration has long been the subject of research. This is commonly achieved by obtaining multiple undersampled images, simultaneously, through parallel imaging. In this paper, we propose the Dual-Octave Network (DONet), which is capable of learning multi-scale spatial-frequency features from both the real and ima… ▽ More

    Submitted 12 June, 2021; v1 submitted 12 May, 2021; originally announced May 2021.

    Comments: arXiv admin note: substantial text overlap with arXiv:2104.05345

    Journal ref: IEEE Transactions on Neural Networks and Learning Systems, 2021

  24. arXiv:2104.05345  [pdf, other

    eess.IV cs.CV

    Dual-Octave Convolution for Accelerated Parallel MR Image Reconstruction

    Authors: Chun-Mei Feng, Zhanyuan Yang, Geng Chen, Yong Xu, Ling Shao

    Abstract: Magnetic resonance (MR) image acquisition is an inherently prolonged process, whose acceleration by obtaining multiple undersampled images simultaneously through parallel imaging has always been the subject of research. In this paper, we propose the Dual-Octave Convolution (Dual-OctConv), which is capable of learning multi-scale spatial-frequency features from both real and imaginary components, f… ▽ More

    Submitted 12 April, 2021; originally announced April 2021.

    Comments: Proceedings of the 35th AAAI Conference on Artificial Intelligence (AAAI) 2021

    Journal ref: Proceedings of the 35th AAAI Conference on Artificial Intelligence (AAAI) 2021

  25. arXiv:2103.14819  [pdf, other

    eess.SY

    Backup Plan Constrained Model Predictive Control

    Authors: Hunmin Kim, Hyungjin Yoon, Wenbin Wan, Naira Hovakimyan, Lui Sha, Petros Voulgaris

    Abstract: This article proposes a new safety concept: backup plan safety. The backup plan safety is defined as the ability to complete one of the alternative missions in the case of primary mission abortion. To incorporate this new safety concept in control problems, we formulate a feasibility maximization problem that adopts additional (virtual) input horizons toward the alternative missions on top of the… ▽ More

    Submitted 27 March, 2021; originally announced March 2021.

  26. arXiv:2103.11587  [pdf, other

    cs.CV eess.IV

    Brain Image Synthesis with Unsupervised Multivariate Canonical CSC$\ell_4$Net

    Authors: Yawen Huang, Feng Zheng, Danyang Wang, Weilin Huang, Matthew R. Scott, Ling Shao

    Abstract: Recent advances in neuroscience have highlighted the effectiveness of multi-modal medical data for investigating certain pathologies and understanding human cognition. However, obtaining full sets of different modalities is limited by various factors, such as long acquisition times, high examination costs and artifact suppression. In addition, the complexity, high dimensionality and heterogeneity… ▽ More

    Submitted 22 March, 2021; originally announced March 2021.

    Comments: 10 pages, 5 figures CVPR2021 oral

  27. arXiv:2103.10825  [pdf, other

    eess.IV cs.CV

    Variational Knowledge Distillation for Disease Classification in Chest X-Rays

    Authors: Tom van Sonsbeek, Xiantong Zhen, Marcel Worring, Ling Shao

    Abstract: Disease classification relying solely on imaging data attracts great interest in medical image analysis. Current models could be further improved, however, by also employing Electronic Health Records (EHRs), which contain rich information on patients and findings from clinicians. It is challenging to incorporate this information into disease classification due to the high reliance on clinician inp… ▽ More

    Submitted 19 March, 2021; originally announced March 2021.

  28. arXiv:2012.02776  [pdf, other

    cs.CV cs.LG eess.IV

    Learning to Fuse Asymmetric Feature Maps in Siamese Trackers

    Authors: Wencheng Han, Xingping Dong, Fahad Shahbaz Khan, Ling Shao, Jianbing Shen

    Abstract: Recently, Siamese-based trackers have achieved promising performance in visual tracking. Most recent Siamese-based trackers typically employ a depth-wise cross-correlation (DW-XCorr) to obtain multi-channel correlation information from the two feature maps (target and search region). However, DW-XCorr has several limitations within Siamese-based tracking: it can easily be fooled by distractors, ha… ▽ More

    Submitted 30 March, 2021; v1 submitted 4 December, 2020; originally announced December 2020.

    Comments: Accepted by CVPR2021

  29. arXiv:2010.06616  [pdf, ps, other

    eess.SY

    Finite-Time Model Inference From A Single Noisy Trajectory

    Authors: Yanbing Mao, Naira Hovakimyan, Petros Voulgaris, Lui Sha

    Abstract: This paper proposes a novel model inference procedure to identify system matrix from a single noisy trajectory over a finite-time interval. The proposed inference procedure comprises an observation data processor, a redundant data processor and an ordinary least-square estimator, wherein the data processors mitigate the influence of observation noise on inference error. We first systematically inv… ▽ More

    Submitted 1 January, 2021; v1 submitted 13 October, 2020; originally announced October 2020.

    Comments: Submitted

  30. arXiv:2009.12349  [pdf, other

    eess.SY

    Robust Vehicle Lane Keeping Control with Networked Proactive Adaptation

    Authors: Hunmin Kim, Wenbin Wan, Naira Hovakimyan, Lui Sha, Petros Voulgaris

    Abstract: Road condition is an important environmental factor for autonomous vehicle control. A dramatic change in the road condition from the nominal status is a source of uncertainty that can lead to a system failure. Once the vehicle encounters an uncertain environment, such as hitting an ice patch, it is too late to reduce the speed, and the vehicle can lose control. To cope with future uncertainties in… ▽ More

    Submitted 28 September, 2020; v1 submitted 25 September, 2020; originally announced September 2020.

  31. arXiv:2009.08973  [pdf, other

    cs.LG cs.AI cs.RO eess.SY stat.ML

    GRAC: Self-Guided and Self-Regularized Actor-Critic

    Authors: Lin Shao, Yifan You, Mengyuan Yan, Qingyun Sun, Jeannette Bohg

    Abstract: Deep reinforcement learning (DRL) algorithms have successfully been demonstrated on a range of challenging decision making and control tasks. One dominant component of recent deep reinforcement learning algorithms is the target network which mitigates the divergence when learning the Q function. However, target networks can slow down the learning process due to delayed function updates. Our main c… ▽ More

    Submitted 10 November, 2020; v1 submitted 18 September, 2020; originally announced September 2020.

  32. arXiv:2008.02101  [pdf, other

    eess.IV cs.CV

    Structure Preserving Stain Normalization of Histopathology Images Using Self-Supervised Semantic Guidance

    Authors: Dwarikanath Mahapatra, Behzad Bozorgtabar, Jean-Philippe Thiran, Ling Shao

    Abstract: Although generative adversarial network (GAN) based style transfer is state of the art in histopathology color-stain normalization, they do not explicitly integrate structural information of tissues. We propose a self-supervised approach to incorporate semantic guidance into a GAN based stain normalization framework and preserve detailed structural information. Our method does not require manual s… ▽ More

    Submitted 3 June, 2021; v1 submitted 5 August, 2020; originally announced August 2020.

  33. arXiv:2008.01627  [pdf, ps, other

    eess.SY

    SL1-Simplex: Safe Velocity Regulation of Self-Driving Vehicles in Dynamic and Unforeseen Environments

    Authors: Yanbing Mao, Yuliang Gu, Naira Hovakimyan, Lui Sha, Petros Voulgaris

    Abstract: This paper proposes a novel extension of the Simplex architecture with model switching and model learning to achieve safe velocity regulation of self-driving vehicles in dynamic and unforeseen environments. To guarantee the reliability of autonomous vehicles, an $\mathcal{L}_{1}$ adaptive controller that compensates for uncertainties and disturbances is employed by the Simplex architecture as a ve… ▽ More

    Submitted 1 February, 2022; v1 submitted 4 August, 2020; originally announced August 2020.

    Comments: Submitted to ACM Transactions on Cyber-Physical Systems

  34. arXiv:2006.11538  [pdf, other

    cs.CV cs.LG eess.IV

    Pyramidal Convolution: Rethinking Convolutional Neural Networks for Visual Recognition

    Authors: Ionut Cosmin Duta, Li Liu, Fan Zhu, Ling Shao

    Abstract: This work introduces pyramidal convolution (PyConv), which is capable of processing the input at multiple filter scales. PyConv contains a pyramid of kernels, where each level involves different types of filters with varying size and depth, which are able to capture different levels of details in the scene. On top of these improved recognition capabilities, PyConv is also efficient and, with our f… ▽ More

    Submitted 20 June, 2020; originally announced June 2020.

  35. arXiv:2006.11392  [pdf, other

    eess.IV cs.CV

    PraNet: Parallel Reverse Attention Network for Polyp Segmentation

    Authors: Deng-Ping Fan, Ge-Peng Ji, Tao Zhou, Geng Chen, Huazhu Fu, Jianbing Shen, Ling Shao

    Abstract: Colonoscopy is an effective technique for detecting colorectal polyps, which are highly related to colorectal cancer. In clinical practice, segmenting polyps from colonoscopy images is of great importance since it provides valuable information for diagnosis and surgery. However, accurate polyp segmentation is a challenging task, for two major reasons: (i) the same type of polyps has a diversity of… ▽ More

    Submitted 3 July, 2020; v1 submitted 13 June, 2020; originally announced June 2020.

    Comments: Accepted to MICCAI 2020

  36. arXiv:2006.10135  [pdf, other

    eess.IV cs.CV cs.LG

    M2Net: Multi-modal Multi-channel Network for Overall Survival Time Prediction of Brain Tumor Patients

    Authors: Tao Zhou, Huazhu Fu, Yu Zhang, Changqing Zhang, Xiankai Lu, Jianbing Shen, Ling Shao

    Abstract: Early and accurate prediction of overall survival (OS) time can help to obtain better treatment planning for brain tumor patients. Although many OS time prediction methods have been developed and obtain promising results, there are still several issues. First, conventional prediction methods rely on radiomic features at the local lesion area of a magnetic resonance (MR) volume, which may not repre… ▽ More

    Submitted 14 July, 2020; v1 submitted 1 June, 2020; originally announced June 2020.

    Comments: Accepted by MICCAI'20

  37. arXiv:2005.07697  [pdf, other

    eess.SY cs.MA

    Safety Constrained Multi-UAV Time Coordination: A Bi-level Control Framework in GPS Denied Environment

    Authors: Wenbin Wan, Hunmin Kim, Yikun Cheng, Naira Hovakimyan, Petros G. Voulgaris, Lui Sha

    Abstract: Unmanned aerial vehicles (UAVs) suffer from sensor drifts in GPS denied environments, which can cause safety issues. To avoid intolerable sensor drifts while completing the time-critical coordination task for multi-UAV systems, we propose a safety constrained bi-level control framework. The first level is the time-critical coordination level that achieves a consensus of coordination states and pro… ▽ More

    Submitted 19 May, 2020; v1 submitted 14 May, 2020; originally announced May 2020.

    Comments: arXiv admin note: substantial text overlap with arXiv:1910.10826

  38. arXiv:2005.05594  [pdf, other

    eess.IV cs.CV

    Modeling and Enhancing Low-quality Retinal Fundus Images

    Authors: Ziyi Shen, Huazhu Fu, Jianbing Shen, Ling Shao

    Abstract: Retinal fundus images are widely used for the clinical screening and diagnosis of eye diseases. However, fundus images captured by operators with various levels of experience have a large variation in quality. Low-quality fundus images increase uncertainty in clinical observation and lead to the risk of misdiagnosis. However, due to the special optical beam of fundus imaging and structure of the r… ▽ More

    Submitted 9 December, 2020; v1 submitted 12 May, 2020; originally announced May 2020.

  39. arXiv:2004.14133  [pdf, other

    eess.IV cs.CV cs.LG

    Inf-Net: Automatic COVID-19 Lung Infection Segmentation from CT Images

    Authors: Deng-Ping Fan, Tao Zhou, Ge-Peng Ji, Yi Zhou, Geng Chen, Huazhu Fu, Jianbing Shen, Ling Shao

    Abstract: Coronavirus Disease 2019 (COVID-19) spread globally in early 2020, causing the world to face an existential health crisis. Automated detection of lung infections from computed tomography (CT) images offers a great potential to augment the traditional healthcare strategy for tackling COVID-19. However, segmenting infected regions from CT slices faces several challenges, including high variation in… ▽ More

    Submitted 21 May, 2020; v1 submitted 22 April, 2020; originally announced April 2020.

    Comments: To appear in IEEE TMI. The code is released in: https://github.com/DengPingFan/Inf-Net

  40. arXiv:2004.08499  [pdf, other

    cs.RO cs.LG eess.SY

    Design and Control of Roller Grasper V2 for In-Hand Manipulation

    Authors: Shenli Yuan, Lin Shao, Connor L. Yako, Alex Gruebele, J. Kenneth Salisbury

    Abstract: The ability to perform in-hand manipulation still remains an unsolved problem; having this capability would allow robots to perform sophisticated tasks requiring repositioning and reorienting of grasped objects. In this work, we present a novel non-anthropomorphic robot grasper with the ability to manipulate objects by means of active surfaces at the fingertips. Active surfaces are achieved by sph… ▽ More

    Submitted 17 November, 2020; v1 submitted 17 April, 2020; originally announced April 2020.

    Comments: 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) October 25-29, 2020, Las Vegas, NV, USA (Virtual)

  41. arXiv:2004.04491  [pdf, other

    cs.CV cs.LG eess.IV

    Multi-Granularity Canonical Appearance Pooling for Remote Sensing Scene Classification

    Authors: S. Wang, Y. Guan, L. Shao

    Abstract: Recognising remote sensing scene images remains challenging due to large visual-semantic discrepancies. These mainly arise due to the lack of detailed annotations that can be employed to align pixel-level representations with high-level semantic labels. As the tagging process is labour-intensive and subjective, we hereby propose a novel Multi-Granularity Canonical Appearance Pooling (MG-CAP) to au… ▽ More

    Submitted 9 April, 2020; originally announced April 2020.

    Comments: This paper is going to be published by IEEE Transactions on Image Processing

    Journal ref: IEEE Transactions on Image Processing 29, 5396--5407 (2020)

  42. arXiv:2003.14119  [pdf, other

    eess.IV cs.CV

    Pathological Retinal Region Segmentation From OCT Images Using Geometric Relation Based Augmentation

    Authors: Dwarikanath Mahapatra, Behzad Bozorgtabar, Jean-Philippe Thiran, Ling Shao

    Abstract: Medical image segmentation is an important task for computer aided diagnosis. Pixelwise manual annotations of large datasets require high expertise and is time consuming. Conventional data augmentations have limited benefit by not fully representing the underlying distribution of the training set, thus affecting model robustness when tested on images captured from different sources. Prior work lev… ▽ More

    Submitted 25 April, 2020; v1 submitted 31 March, 2020; originally announced March 2020.

  43. arXiv:2003.07761  [pdf, other

    eess.IV cs.CV

    CycleISP: Real Image Restoration via Improved Data Synthesis

    Authors: Syed Waqas Zamir, Aditya Arora, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, Ming-Hsuan Yang, Ling Shao

    Abstract: The availability of large-scale datasets has helped unleash the true potential of deep convolutional neural networks (CNNs). However, for the single-image denoising problem, capturing a real dataset is an unacceptably expensive and cumbersome procedure. Consequently, image denoising algorithms are mostly developed and evaluated on synthetic data that is usually generated with a widespread assumpti… ▽ More

    Submitted 17 March, 2020; originally announced March 2020.

    Comments: CVPR 2020 (Oral)

  44. arXiv:2003.04253  [pdf, other

    cs.CV cs.LG eess.IV

    Motion-Attentive Transition for Zero-Shot Video Object Segmentation

    Authors: Tianfei Zhou, Shunzhou Wang, Yi Zhou, Yazhou Yao, Jianwu Li, Ling Shao

    Abstract: In this paper, we present a novel Motion-Attentive Transition Network (MATNet) for zero-shot video object segmentation, which provides a new way of leveraging motion information to reinforce spatio-temporal object representation. An asymmetric attention block, called Motion-Attentive Transition (MAT), is designed within a two-stream encoder, which transforms appearance features into motion-attenti… ▽ More

    Submitted 9 July, 2020; v1 submitted 9 March, 2020; originally announced March 2020.

    Comments: AAAI 2020. Code: https://github.com/tfzhou/MATNet

  45. arXiv:2002.05000  [pdf, other

    cs.CV eess.IV

    Hi-Net: Hybrid-fusion Network for Multi-modal MR Image Synthesis

    Authors: Tao Zhou, Huazhu Fu, Geng Chen, Jianbing Shen, Ling Shao

    Abstract: Magnetic resonance imaging (MRI) is a widely used neuroimaging technique that can provide images of different contrasts (i.e., modalities). Fusing this multi-modal data has proven particularly effective for boosting model performance in many tasks. However, due to poor data quality and frequent patient dropout, collecting all modalities for every patient remains a challenge. Medical image synthesi… ▽ More

    Submitted 11 February, 2020; originally announced February 2020.

    Comments: has been accepted by IEEE TMI

  46. DR-GAN: Conditional Generative Adversarial Network for Fine-Grained Lesion Synthesis on Diabetic Retinopathy Images

    Authors: Yi Zhou, Boyang Wang, Xiaodong He, Shanshan Cui, Ling Shao

    Abstract: Diabetic retinopathy (DR) is a complication of diabetes that severely affects eyes. It can be graded into five levels of severity according to international protocol. However, optimizing a grading model to have strong generalizability requires a large amount of balanced training data, which is difficult to collect particularly for the high severity levels. Typical data augmentation methods, includ… ▽ More

    Submitted 11 November, 2020; v1 submitted 10 December, 2019; originally announced December 2019.

    Comments: Extension work of our MICCAI paper

    Journal ref: IEEE Journal of Biomedical and Health Informatics 2020

  47. arXiv:1911.04470  [pdf, other

    cs.CV cs.LG eess.IV

    Semi-Heterogeneous Three-Way Joint Embedding Network for Sketch-Based Image Retrieval

    Authors: Jianjun Lei, Yuxin Song, Bo Peng, Zhanyu Ma, Ling Shao, Yi-Zhe Song

    Abstract: Sketch-based image retrieval (SBIR) is a challenging task due to the large cross-domain gap between sketches and natural images. How to align abstract sketches and natural images into a common high-level semantic space remains a key problem in SBIR. In this paper, we propose a novel semi-heterogeneous three-way joint embedding network (Semi3-Net), which integrates three branches (a sketch branch,… ▽ More

    Submitted 9 November, 2019; originally announced November 2019.

    Comments: Accepted by IEEE Transactions on Circuits and Systems for Video Technology

  48. arXiv:1911.00969  [pdf, other

    cs.RO cs.AI cs.CV cs.LG eess.SY

    Learning to Scaffold the Development of Robotic Manipulation Skills

    Authors: Lin Shao, Toki Migimatsu, Jeannette Bohg

    Abstract: Learning contact-rich, robotic manipulation skills is a challenging problem due to the high-dimensionality of the state and action space as well as uncertainty from noisy sensors and inaccurate motor control. To combat these factors and achieve more robust manipulation, humans actively exploit contact constraints in the environment. By adopting a similar strategy, robots can also achieve more robu… ▽ More

    Submitted 5 October, 2020; v1 submitted 3 November, 2019; originally announced November 2019.

    Comments: Accepted to IEEE International Conference on Robotics and Automation (ICRA) 2020

  49. arXiv:1910.10826  [pdf, other

    eess.SY

    A Safety Constrained Control Framework for UAVs in GPS Denied Environment

    Authors: Wenbin Wan, Hunmin Kim, Naira Hovakimyan, Lui Sha, Petros G. Voulgaris

    Abstract: Unmanned aerial vehicles (UAVs) suffer from sensor drifts in GPS denied environments, which can lead to potentially dangerous situations. To avoid intolerable sensor drifts in the presence of GPS spoofing attacks, we propose a safety constrained control framework that adapts the UAV at a path re-planning level to support resilient state estimation against GPS spoofing attacks. The attack detector… ▽ More

    Submitted 12 April, 2020; v1 submitted 23 October, 2019; originally announced October 2019.

  50. arXiv:1909.03749  [pdf, other

    cs.LG cs.CV eess.IV stat.ML

    Learning Visual Dynamics Models of Rigid Objects using Relational Inductive Biases

    Authors: Fabio Ferreira, Lin Shao, Tamim Asfour, Jeannette Bohg

    Abstract: Endowing robots with human-like physical reasoning abilities remains challenging. We argue that existing methods often disregard spatio-temporal relations and by using Graph Neural Networks (GNNs) that incorporate a relational inductive bias, we can shift the learning process towards exploiting relations. In this work, we learn action-conditional forward dynamics models of a simulated manipulation… ▽ More

    Submitted 23 October, 2019; v1 submitted 9 September, 2019; originally announced September 2019.

    Comments: short paper (4 pages, two figures), accepted to NeurIPS 2019 Graph Representation Learning workshop