Skip to main content

Showing 1–39 of 39 results for author: Sheng, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.18575  [pdf, ps, other

    cs.CV

    2D Triangle Splatting for Direct Differentiable Mesh Training

    Authors: Kaifeng Sheng, Zheng Zhou, Yingliang Peng, Qianwei Wang

    Abstract: Differentiable rendering with 3D Gaussian primitives has emerged as a powerful method for reconstructing high-fidelity 3D scenes from multi-view images. While it offers improvements over NeRF-based methods, this representation still encounters challenges with rendering speed and advanced rendering effects, such as relighting and shadow rendering, compared to mesh-based models. In this paper, we pr… ▽ More

    Submitted 26 June, 2025; v1 submitted 23 June, 2025; originally announced June 2025.

    Comments: 13 pages, 8 figures

  2. arXiv:2506.03478  [pdf, ps, other

    cs.GR cs.CV

    Facial Appearance Capture at Home with Patch-Level Reflectance Prior

    Authors: Yuxuan Han, Junfeng Lyu, Kuan Sheng, Minghao Que, Qixuan Zhang, Lan Xu, Feng Xu

    Abstract: Existing facial appearance capture methods can reconstruct plausible facial reflectance from smartphone-recorded videos. However, the reconstruction quality is still far behind the ones based on studio recordings. This paper fills the gap by developing a novel daily-used solution with a co-located smartphone and flashlight video capture setting in a dim room. To enhance the quality, our key observ… ▽ More

    Submitted 3 June, 2025; originally announced June 2025.

    Comments: ACM Transactions on Graphics (Proc. of SIGGRAPH), 2025. Code: https://github.com/yxuhan/DoRA; Project Page: https://yxuhan.github.io/DoRA

  3. arXiv:2505.19239  [pdf, ps, other

    cs.CV

    DriveX: Omni Scene Modeling for Learning Generalizable World Knowledge in Autonomous Driving

    Authors: Chen Shi, Shaoshuai Shi, Kehua Sheng, Bo Zhang, Li Jiang

    Abstract: Data-driven learning has advanced autonomous driving, yet task-specific models struggle with out-of-distribution scenarios due to their narrow optimization objectives and reliance on costly annotated data. We present DriveX, a self-supervised world model that learns generalizable scene dynamics and holistic representations (geometric, semantic, and motion) from large-scale driving videos. DriveX i… ▽ More

    Submitted 25 May, 2025; originally announced May 2025.

  4. arXiv:2503.05088  [pdf, other

    cs.RO

    An End-to-End Learning-Based Multi-Sensor Fusion for Autonomous Vehicle Localization

    Authors: Changhong Lin, Jiarong Lin, Zhiqiang Sui, XiaoZhi Qu, Rui Wang, Kehua Sheng, Bo Zhang

    Abstract: Multi-sensor fusion is essential for autonomous vehicle localization, as it is capable of integrating data from various sources for enhanced accuracy and reliability. The accuracy of the integrated location and orientation depends on the precision of the uncertainty modeling. Traditional methods of uncertainty modeling typically assume a Gaussian distribution and involve manual heuristic parameter… ▽ More

    Submitted 6 March, 2025; originally announced March 2025.

    Comments: 7 pages, 8 figures, to be published in ICRA2025

  5. arXiv:2503.05051  [pdf

    eess.IV cs.AI cs.CV

    Accelerated Patient-specific Non-Cartesian MRI Reconstruction using Implicit Neural Representations

    Authors: Di Xu, Hengjie Liu, Xin Miao, Daniel O'Connor, Jessica E. Scholey, Wensha Yang, Mary Feng, Michael Ohliger, Hui Lin, Dan Ruan, Yang Yang, Ke Sheng

    Abstract: The scanning time for a fully sampled MRI can be undesirably lengthy. Compressed sensing has been developed to minimize image artifacts in accelerated scans, but the required iterative reconstruction is computationally complex and difficult to generalize on new cases. Image-domain-based deep learning methods (e.g., convolutional neural networks) emerged as a faster alternative but face challenges… ▽ More

    Submitted 6 March, 2025; originally announced March 2025.

  6. arXiv:2502.19771  [pdf, other

    cs.CY cs.AI

    The erasure of intensive livestock farming in text-to-image generative AI

    Authors: Kehan Sheng, Frank A. M. Tuyttens, Marina A. G. von Keyserlingk

    Abstract: Generative AI (e.g., ChatGPT) is increasingly integrated into people's daily lives. While it is known that AI perpetuates biases against marginalized human groups, their impact on non-human animals remains understudied. We found that ChatGPT's text-to-image model (DALL-E 3) introduces a strong bias toward romanticizing livestock farming as dairy cows on pasture and pigs rooting in mud. This bias r… ▽ More

    Submitted 12 March, 2025; v1 submitted 27 February, 2025; originally announced February 2025.

  7. arXiv:2412.10629  [pdf

    eess.IV cs.AI cs.CV

    Rapid Reconstruction of Extremely Accelerated Liver 4D MRI via Chained Iterative Refinement

    Authors: Di Xu, Xin Miao, Hengjie Liu, Jessica E. Scholey, Wensha Yang, Mary Feng, Michael Ohliger, Hui Lin, Yi Lao, Yang Yang, Ke Sheng

    Abstract: Abstract Purpose: High-quality 4D MRI requires an impractically long scanning time for dense k-space signal acquisition covering all respiratory phases. Accelerated sparse sampling followed by reconstruction enhancement is desired but often results in degraded image quality and long reconstruction time. We hereby propose the chained iterative reconstruction network (CIRNet) for efficient sparse-sa… ▽ More

    Submitted 13 December, 2024; originally announced December 2024.

  8. arXiv:2411.08158  [pdf

    eess.IV cs.CV

    TomoGRAF: A Robust and Generalizable Reconstruction Network for Single-View Computed Tomography

    Authors: Di Xu, Yang Yang, Hengjie Liu, Qihui Lyu, Martina Descovich, Dan Ruan, Ke Sheng

    Abstract: Computed tomography (CT) provides high spatial resolution visualization of 3D structures for scientific and clinical applications. Traditional analytical/iterative CT reconstruction algorithms require hundreds of angular data samplings, a condition that may not be met in practice due to physical and mechanical limitations. Sparse view CT reconstruction has been proposed using constrained optimizat… ▽ More

    Submitted 12 November, 2024; originally announced November 2024.

  9. arXiv:2410.05665  [pdf, other

    cs.CV

    Edge-Cloud Collaborative Satellite Image Analysis for Efficient Man-Made Structure Recognition

    Authors: Kaicheng Sheng, Junxiao Xue, Hui Zhang

    Abstract: The increasing availability of high-resolution satellite imagery has created immense opportunities for various applications. However, processing and analyzing such vast amounts of data in a timely and accurate manner poses significant challenges. The paper presents a new satellite image processing architecture combining edge and cloud computing to better identify man-made structures against natura… ▽ More

    Submitted 7 October, 2024; originally announced October 2024.

  10. arXiv:2405.12357  [pdf

    eess.IV cs.CV

    Paired Conditional Generative Adversarial Network for Highly Accelerated Liver 4D MRI

    Authors: Di Xu, Xin Miao, Hengjie Liu, Jessica E. Scholey, Wensha Yang, Mary Feng, Michael Ohliger, Hui Lin, Yi Lao, Yang Yang, Ke Sheng

    Abstract: Purpose: 4D MRI with high spatiotemporal resolution is desired for image-guided liver radiotherapy. Acquiring densely sampling k-space data is time-consuming. Accelerated acquisition with sparse samples is desirable but often causes degraded image quality or long reconstruction time. We propose the Reconstruct Paired Conditional Generative Adversarial Network (Re-Con-GAN) to shorten the 4D MRI rec… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

  11. arXiv:2401.06893  [pdf, other

    eess.IV cs.CV

    Local Gamma Augmentation for Ischemic Stroke Lesion Segmentation on MRI

    Authors: Jon Middleton, Marko Bauer, Kaining Sheng, Jacob Johansen, Mathias Perslev, Silvia Ingala, Mads Nielsen, Akshay Pai

    Abstract: The identification and localisation of pathological tissues in medical images continues to command much attention among deep learning practitioners. When trained on abundant datasets, deep neural networks can match or exceed human performance. However, the scarcity of annotated data complicates the training of these models. Data augmentation techniques can compensate for a lack of training samples… ▽ More

    Submitted 12 January, 2024; originally announced January 2024.

    Comments: Camera-ready version for Northern Lights Deep Learning Conference 2024, 7 pages, 2 figures

  12. arXiv:2401.02192  [pdf

    eess.IV cs.CV cs.LG

    Nodule detection and generation on chest X-rays: NODE21 Challenge

    Authors: Ecem Sogancioglu, Bram van Ginneken, Finn Behrendt, Marcel Bengs, Alexander Schlaefer, Miron Radu, Di Xu, Ke Sheng, Fabien Scalzo, Eric Marcus, Samuele Papa, Jonas Teuwen, Ernst Th. Scholten, Steven Schalekamp, Nils Hendrix, Colin Jacobs, Ward Hendrix, Clara I Sánchez, Keelin Murphy

    Abstract: Pulmonary nodules may be an early manifestation of lung cancer, the leading cause of cancer-related deaths among both men and women. Numerous studies have established that deep learning methods can yield high-performance levels in the detection of lung nodules in chest X-rays. However, the lack of gold-standard public datasets slows down the progression of the research and prevents benchmarking of… ▽ More

    Submitted 4 January, 2024; originally announced January 2024.

    Comments: 15 pages, 5 figures

  13. arXiv:2312.07221  [pdf, other

    cs.CV cs.AI

    Transferring CLIP's Knowledge into Zero-Shot Point Cloud Semantic Segmentation

    Authors: Yuanbin Wang, Shaofei Huang, Yulu Gao, Zhen Wang, Rui Wang, Kehua Sheng, Bo Zhang, Si Liu

    Abstract: Traditional 3D segmentation methods can only recognize a fixed range of classes that appear in the training set, which limits their application in real-world scenarios due to the lack of generalization ability. Large-scale visual-language pre-trained models, such as CLIP, have shown their generalization ability in the zero-shot 2D vision tasks, but are still unable to be applied to 3D semantic seg… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

  14. arXiv:2311.02880  [pdf, other

    cs.LG

    MultiSPANS: A Multi-range Spatial-Temporal Transformer Network for Traffic Forecast via Structural Entropy Optimization

    Authors: Dongcheng Zou, Senzhang Wang, Xuefeng Li, Hao Peng, Yuandong Wang, Chunyang Liu, Kehua Sheng, Bo Zhang

    Abstract: Traffic forecasting is a complex multivariate time-series regression task of paramount importance for traffic management and planning. However, existing approaches often struggle to model complex multi-range dependencies using local spatiotemporal features and road network hierarchical knowledge. To address this, we propose MultiSPANS. First, considering that an individual recording point cannot r… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

    Comments: 10 pages, 7 figures, conference. The work has been accepted by WSDM2024

  15. arXiv:2309.10230  [pdf, other

    cs.CV

    LiON: Learning Point-wise Abstaining Penalty for LiDAR Outlier DetectioN Using Diverse Synthetic Data

    Authors: Shaocong Xu, Pengfei Li, Qianpu Sun, Xinyu Liu, Yang Li, Shihui Guo, Zhen Wang, Bo Jiang, Rui Wang, Kehua Sheng, Bo Zhang, Li Jiang, Hao Zhao, Yilun Chen

    Abstract: LiDAR-based semantic scene understanding is an important module in the modern autonomous driving perception stack. However, identifying outlier points in a LiDAR point cloud is challenging as LiDAR point clouds lack semantically-rich information. While former SOTA methods adopt heuristic architectures, we revisit this problem from the perspective of Selective Classification, which introduces a sel… ▽ More

    Submitted 18 December, 2024; v1 submitted 18 September, 2023; originally announced September 2023.

    Comments: Accepted by AAAI2025. Codes are available at https://github.com/Daniellli/LiON/

  16. arXiv:2309.10227  [pdf

    eess.IV cs.CV

    Learning Dynamic MRI Reconstruction with Convolutional Network Assisted Reconstruction Swin Transformer

    Authors: Di Xu, Hengjie Liu, Dan Ruan, Ke Sheng

    Abstract: Dynamic magnetic resonance imaging (DMRI) is an effective imaging tool for diagnosis tasks that require motion tracking of a certain anatomy. To speed up DMRI acquisition, k-space measurements are commonly undersampled along spatial or spatial-temporal domains. The difficulty of recovering useful information increases with increasing undersampling ratios. Compress sensing was invented for this pur… ▽ More

    Submitted 18 September, 2023; originally announced September 2023.

    Comments: MICCAI 2023 Workshop

  17. arXiv:2304.08491  [pdf, other

    cs.CV

    Delving into Shape-aware Zero-shot Semantic Segmentation

    Authors: Xinyu Liu, Beiwen Tian, Zhen Wang, Rui Wang, Kehua Sheng, Bo Zhang, Hao Zhao, Guyue Zhou

    Abstract: Thanks to the impressive progress of large-scale vision-language pretraining, recent recognition models can classify arbitrary objects in a zero-shot and open-set manner, with a surprisingly high accuracy. However, translating this success to semantic segmentation is not trivial, because this dense prediction task requires not only accurate semantic understanding but also fine shape delineation an… ▽ More

    Submitted 17 April, 2023; originally announced April 2023.

    Comments: Accepted to CVPR 2023, code: https://github.com/Liuxinyv/SAZS

  18. arXiv:2302.09696  [pdf

    eess.IV cs.CV

    An Efficient and Robust Method for Chest X-Ray Rib Suppression that Improves Pulmonary Abnormality Diagnosis

    Authors: Di Xu, Qifan Xu, Kevin Nhieu, Dan Ruan, Ke Sheng

    Abstract: Suppression of thoracic bone shadows on chest X-rays (CXRs) has been indicated to improve the diagnosis of pulmonary disease. Previous approaches can be categorized as unsupervised physical and supervised deep learning models. Nevertheless, with physical models able to preserve morphological details but at the cost of extremely long processing time, existing DL methods face challenges of gathering… ▽ More

    Submitted 19 February, 2023; originally announced February 2023.

  19. Generative Domain Adaptation for Face Anti-Spoofing

    Authors: Qianyu Zhou, Ke-Yue Zhang, Taiping Yao, Ran Yi, Kekai Sheng, Shouhong Ding, Lizhuang Ma

    Abstract: Face anti-spoofing (FAS) approaches based on unsupervised domain adaption (UDA) have drawn growing attention due to promising performances for target scenarios. Most existing UDA FAS methods typically fit the trained models to the target domain via aligning the distribution of semantic high-level features. However, insufficient supervision of unlabeled target domains and neglect of low-level featu… ▽ More

    Submitted 11 September, 2022; v1 submitted 20 July, 2022; originally announced July 2022.

    Comments: Accepted to European Conference on Computer Vision (ECCV), 2022

  20. arXiv:2206.11134  [pdf, other

    cs.CV

    Open Vocabulary Object Detection with Proposal Mining and Prediction Equalization

    Authors: Peixian Chen, Kekai Sheng, Mengdan Zhang, Mingbao Lin, Yunhang Shen, Shaohui Lin, Bo Ren, Ke Li

    Abstract: Open-vocabulary object detection (OVD) aims to scale up vocabulary size to detect objects of novel categories beyond the training vocabulary. Recent work resorts to the rich knowledge in pre-trained vision-language models. However, existing methods are ineffective in proposal-level vision-language alignment. Meanwhile, the models usually suffer from confidence bias toward base categories and perfo… ▽ More

    Submitted 24 November, 2022; v1 submitted 22 June, 2022; originally announced June 2022.

  21. arXiv:2206.06829  [pdf, other

    cs.CV

    Efficient Decoder-free Object Detection with Transformers

    Authors: Peixian Chen, Mengdan Zhang, Yunhang Shen, Kekai Sheng, Yuting Gao, Xing Sun, Ke Li, Chunhua Shen

    Abstract: Vision transformers (ViTs) are changing the landscape of object detection approaches. A natural usage of ViTs in detection is to replace the CNN-based backbone with a transformer-based backbone, which is straightforward and effective, with the price of bringing considerable computation burden for inference. More subtle usage is the DETR family, which eliminates the need for many hand-designed comp… ▽ More

    Submitted 16 June, 2022; v1 submitted 14 June, 2022; originally announced June 2022.

    Comments: Update metadata, 10 pages

  22. arXiv:2203.12217  [pdf, other

    cs.CV

    Training-free Transformer Architecture Search

    Authors: Qinqin Zhou, Kekai Sheng, Xiawu Zheng, Ke Li, Xing Sun, Yonghong Tian, Jie Chen, Rongrong Ji

    Abstract: Recently, Vision Transformer (ViT) has achieved remarkable success in several computer vision tasks. The progresses are highly relevant to the architecture design, then it is worthwhile to propose Transformer Architecture Search (TAS) to search for better ViTs automatically. However, current TAS methods are time-consuming and existing zero-cost proxies in CNN do not generalize well to the ViT sear… ▽ More

    Submitted 23 March, 2022; originally announced March 2022.

    Comments: Accepted to CVPR 2022

  23. arXiv:2203.10812  [pdf, other

    cs.CV

    ARM: Any-Time Super-Resolution Method

    Authors: Bohong Chen, Mingbao Lin, Kekai Sheng, Mengdan Zhang, Peixian Chen, Ke Li, Liujuan Cao, Rongrong Ji

    Abstract: This paper proposes an Any-time super-Resolution Method (ARM) to tackle the over-parameterized single image super-resolution (SISR) models. Our ARM is motivated by three observations: (1) The performance of different image patches varies with SISR networks of different sizes. (2) There is a tradeoff between computation overhead and performance of the reconstructed image. (3) Given an input image,… ▽ More

    Submitted 18 July, 2022; v1 submitted 21 March, 2022; originally announced March 2022.

    Comments: Accepted by ECCV 2022

  24. arXiv:2202.07173  [pdf, other

    eess.IV cs.CV

    To what extent can Plug-and-Play methods outperform neural networks alone in low-dose CT reconstruction

    Authors: Qifan Xu, Qihui Lyu, Dan Ruan, Ke Sheng

    Abstract: The Plug-and-Play (PnP) framework was recently introduced for low-dose CT reconstruction to leverage the interpretability and the flexibility of model-based methods to incorporate various plugins, such as trained deep learning (DL) neural networks. However, the benefits of PnP vs. state-of-the-art DL methods have not been clearly demonstrated. In this work, we proposed an improved PnP framework to… ▽ More

    Submitted 14 February, 2022; originally announced February 2022.

    Comments: Accepted to IEEE ISBI 2022

  25. arXiv:2202.02457  [pdf, other

    cs.NI eess.SP

    Path Planning for the Dynamic UAV-Aided Wireless Systems using Monte Carlo Tree Search

    Authors: Yuwen Qian, Kexin Sheng, Chuan Ma, Jun Li, Ming Ding, Mahbub Hassan

    Abstract: For UAV-aided wireless systems, online path planning attracts much attention recently. To better adapt to the real-time dynamic environment, we, for the first time, propose a Monte Carlo Tree Search (MCTS)-based path planning scheme. In details, we consider a single UAV acts as a mobile server to provide computation tasks offloading services for a set of mobile users on the ground, where the movem… ▽ More

    Submitted 13 January, 2022; originally announced February 2022.

  26. arXiv:2112.10474  [pdf, other

    cs.CV

    Reciprocal Normalization for Domain Adaptation

    Authors: Zhiyong Huang, Kekai Sheng, Ke Li, Jian Liang, Taiping Yao, Weiming Dong, Dengwen Zhou, Xing Sun

    Abstract: Batch normalization (BN) is widely used in modern deep neural networks, which has been shown to represent the domain-related knowledge, and thus is ineffective for cross-domain tasks like unsupervised domain adaptation (UDA). Existing BN variant methods aggregate source and target domain knowledge in the same channel in normalization module. However, the misalignment between the features of corres… ▽ More

    Submitted 20 December, 2021; originally announced December 2021.

    Comments: The best feature normalization module for domain adaptation

  27. arXiv:2108.01390  [pdf, other

    cs.CV

    Evo-ViT: Slow-Fast Token Evolution for Dynamic Vision Transformer

    Authors: Yifan Xu, Zhijie Zhang, Mengdan Zhang, Kekai Sheng, Ke Li, Weiming Dong, Liqing Zhang, Changsheng Xu, Xing Sun

    Abstract: Vision transformers (ViTs) have recently received explosive popularity, but the huge computational cost is still a severe issue. Since the computation complexity of ViT is quadratic with respect to the input sequence length, a mainstream paradigm for computation reduction is to reduce the number of tokens. Existing designs include structured spatial compression that uses a progressive shrinking py… ▽ More

    Submitted 6 December, 2021; v1 submitted 3 August, 2021; originally announced August 2021.

    Comments: We propose a novel and effective design for dynamic vision transformer to achieve better computational efficiency. The code is available at https://github.com/YifanXu74/Evo-ViT

  28. arXiv:2106.16128  [pdf, other

    cs.CV

    Dual Reweighting Domain Generalization for Face Presentation Attack Detection

    Authors: Shubao Liu, Ke-Yue Zhang, Taiping Yao, Kekai Sheng, Shouhong Ding, Ying Tai, Jilin Li, Yuan Xie, Lizhuang Ma

    Abstract: Face anti-spoofing approaches based on domain generalization (DG) have drawn growing attention due to their robustness for unseen scenarios. Previous methods treat each sample from multiple domains indiscriminately during the training process, and endeavor to extract a common feature space to improve the generalization. However, due to complex and biased data distribution, directly treating them e… ▽ More

    Submitted 30 June, 2021; originally announced June 2021.

    Comments: accepted on IJCAI 2021

  29. arXiv:2105.02453  [pdf, other

    cs.CV

    Generalizable Representation Learning for Mixture Domain Face Anti-Spoofing

    Authors: Zhihong Chen, Taiping Yao, Kekai Sheng, Shouhong Ding, Ying Tai, Jilin Li, Feiyue Huang, Xinyu Jin

    Abstract: Face anti-spoofing approach based on domain generalization(DG) has drawn growing attention due to its robustness forunseen scenarios. Existing DG methods assume that the do-main label is known.However, in real-world applications, thecollected dataset always contains mixture domains, where thedomain label is unknown. In this case, most of existing meth-ods may not work. Further, even if we can obta… ▽ More

    Submitted 6 May, 2021; originally announced May 2021.

    Comments: Accepted for publication in AAAI2021

  30. arXiv:2104.10376  [pdf, other

    cs.CV cs.LG

    Towards Corruption-Agnostic Robust Domain Adaptation

    Authors: Yifan Xu, Kekai Sheng, Weiming Dong, Baoyuan Wu, Changsheng Xu, Bao-Gang Hu

    Abstract: Big progress has been achieved in domain adaptation in decades. Existing works are always based on an ideal assumption that testing target domain are i.i.d. with training target domains. However, due to unpredictable corruptions (e.g., noise and blur) in real data like web images, domain adaptation methods are increasingly required to be corruption robust on target domains. In this paper, we inves… ▽ More

    Submitted 21 April, 2021; originally announced April 2021.

    Comments: The first literature to investigate the topic of corruption-agnostic robust domain adaptation, a new practical and challenging domain adaptation setting

  31. arXiv:2103.13561  [pdf, other

    cs.CV

    On Evolving Attention Towards Domain Adaptation

    Authors: Kekai Sheng, Ke Li, Xiawu Zheng, Jian Liang, Weiming Dong, Feiyue Huang, Rongrong Ji, Xing Sun

    Abstract: Towards better unsupervised domain adaptation (UDA). Recently, researchers propose various domain-conditioned attention modules and make promising progresses. However, considering that the configuration of attention, i.e., the type and the position of attention module, affects the performance significantly, it is more generalized to optimize the attention configuration automatically to be speciali… ▽ More

    Submitted 24 March, 2021; originally announced March 2021.

    Comments: Among the first to study arbitrary domain adaptation from the perspective of network architecture design

  32. arXiv:2012.02621  [pdf, other

    cs.CV

    Effective Label Propagation for Discriminative Semi-Supervised Domain Adaptation

    Authors: Zhiyong Huang, Kekai Sheng, Weiming Dong, Xing Mei, Chongyang Ma, Feiyue Huang, Dengwen Zhou, Changsheng Xu

    Abstract: Semi-supervised domain adaptation (SSDA) methods have demonstrated great potential in large-scale image classification tasks when massive labeled data are available in the source domain but very few labeled samples are provided in the target domain. Existing solutions usually focus on feature alignment between the two domains while paying little attention to the discrimination capability of learne… ▽ More

    Submitted 4 December, 2020; originally announced December 2020.

  33. arXiv:2011.05323  [pdf, other

    cs.RO

    Robotic Exploration of Unknown 2D Environment Using a Frontier-based Automatic-Differentiable Information Gain Measure

    Authors: Di Deng, Runlin Duan, Jiahong Liu, Kuangjie Sheng, Kenji Shimada

    Abstract: At the heart of path-planning methods for autonomous robotic exploration is a heuristic which encourages exploring unknown regions of the environment. Such heuristics are typically computed using frontier-based or information-theoretic methods. Frontier-based methods define the information gain of an exploration path as the number of boundary cells, or frontiers, which are visible from the path. H… ▽ More

    Submitted 10 November, 2020; originally announced November 2020.

  34. arXiv:2010.08209  [pdf, other

    cs.CV

    Human Perception-based Evaluation Criterion for Ultra-high Resolution Cell Membrane Segmentation

    Authors: Ruohua Shi, Wenyao Wang, Zhixuan Li, Liuyuan He, Kaiwen Sheng, Lei Ma, Kai Du, Tingting Jiang, Tiejun Huang

    Abstract: Computer vision technology is widely used in biological and medical data analysis and understanding. However, there are still two major bottlenecks in the field of cell membrane segmentation, which seriously hinder further research: lack of sufficient high-quality data and lack of suitable evaluation criteria. In order to solve these two problems, this paper first proposes an Ultra-high Resolution… ▽ More

    Submitted 16 October, 2020; originally announced October 2020.

    Comments: submitted to ICLR 2021

  35. arXiv:2005.10052  [pdf, other

    eess.IV cs.CV cs.LG stat.ML

    Lung Segmentation from Chest X-rays using Variational Data Imputation

    Authors: Raghavendra Selvan, Erik B. Dam, Nicki S. Detlefsen, Sofus Rischel, Kaining Sheng, Mads Nielsen, Akshay Pai

    Abstract: Pulmonary opacification is the inflammation in the lungs caused by many respiratory ailments, including the novel corona virus disease 2019 (COVID-19). Chest X-rays (CXRs) with such opacifications render regions of lungs imperceptible, making it difficult to perform automated image analysis on them. In this work, we focus on segmenting lungs from such abnormal CXRs as part of a pipeline aimed at a… ▽ More

    Submitted 7 July, 2020; v1 submitted 20 May, 2020; originally announced May 2020.

    Comments: Accepted to be presented at the first Workshop on the Art of Learning with Missing Values (Artemiss) hosted by the 37th International Conference on Machine Learning (ICML). Source code, training data and the trained models are available here: https://github.com/raghavian/lungVAE/

  36. arXiv:2005.09973  [pdf, other

    cs.CV

    Dynamic Refinement Network for Oriented and Densely Packed Object Detection

    Authors: Xingjia Pan, Yuqiang Ren, Kekai Sheng, Weiming Dong, Haolei Yuan, Xiaowei Guo, Chongyang Ma, Changsheng Xu

    Abstract: Object detection has achieved remarkable progress in the past decade. However, the detection of oriented and densely packed objects remains challenging because of following inherent reasons: (1) receptive fields of neurons are all axis-aligned and of the same shape, whereas objects are usually of diverse shapes and align along various directions; (2) detection models are typically trained with gen… ▽ More

    Submitted 10 June, 2020; v1 submitted 20 May, 2020; originally announced May 2020.

    Comments: Accepted by CVPR 2020 as Oral

  37. arXiv:1911.11419  [pdf, other

    cs.CV

    Revisiting Image Aesthetic Assessment via Self-Supervised Feature Learning

    Authors: Kekai Sheng, Weiming Dong, Menglei Chai, Guohui Wang, Peng Zhou, Feiyue Huang, Bao-Gang Hu, Rongrong Ji, Chongyang Ma

    Abstract: Visual aesthetic assessment has been an active research field for decades. Although latest methods have achieved promising performance on benchmark datasets, they typically rely on a large number of manual annotations including both aesthetic labels and related image attributes. In this paper, we revisit the problem of image aesthetic assessment from the self-supervised feature learning perspectiv… ▽ More

    Submitted 26 November, 2019; originally announced November 2019.

    Comments: AAAI Conference on Artificial Intelligence, 2020, accepted

    Journal ref: Proceedings of AAAI Conference on Articial Intelligence 2020

  38. arXiv:1906.01795  [pdf, other

    cs.CV

    Fully Automated Pancreas Segmentation with Two-stage 3D Convolutional Neural Networks

    Authors: Ningning Zhao, Nuo Tong, Dan Ruan, Ke Sheng

    Abstract: Due to the fact that pancreas is an abdominal organ with very large variations in shape and size, automatic and accurate pancreas segmentation can be challenging for medical image analysis. In this work, we proposed a fully automated two stage framework for pancreas segmentation based on convolutional neural networks (CNN). In the first stage, a U-Net is trained for the down-sampled 3D volume segm… ▽ More

    Submitted 25 July, 2019; v1 submitted 4 June, 2019; originally announced June 2019.

    Comments: This paper has been accepted by MICCAI 2019

  39. arXiv:1707.07089  [pdf, other

    cs.CV

    Motion Compensated Dynamic MRI Reconstruction with Local Affine Optical Flow Estimation

    Authors: Ningning Zhao, Daniel O'Connor, Adrian Basarab, Dan Ruan, Peng Hu, Ke Sheng

    Abstract: This paper proposes a novel framework to reconstruct the dynamic magnetic resonance images (DMRI) with motion compensation (MC). Due to the inherent motion effects during DMRI acquisition, reconstruction of DMRI using motion estimation/compensation (ME/MC) has been studied under a compressed sensing (CS) scheme. In this paper, by embedding the intensity-based optical flow (OF) constraint into the… ▽ More

    Submitted 13 February, 2019; v1 submitted 21 July, 2017; originally announced July 2017.