Skip to main content

Showing 1–39 of 39 results for author: Dai, Q

Searching in archive eess. Search in all archives.
.
  1. arXiv:2505.17847  [pdf, ps, other

    cs.LG cs.AI eess.SY

    TransDF: Time-Series Forecasting Needs Transformed Label Alignment

    Authors: Hao Wang, Licheng Pan, Zhichao Chen, Xu Chen, Qingyang Dai, Lei Wang, Haoxuan Li, Zhouchen Lin

    Abstract: Training time-series forecasting models presents unique challenges in designing effective learning objectives. Existing methods predominantly utilize the temporal mean squared error, which faces two critical challenges: (1) label autocorrelation, which leads to bias from the label sequence likelihood; (2) excessive amount of tasks, which increases with the forecast horizon and complicates optimiza… ▽ More

    Submitted 23 May, 2025; originally announced May 2025.

  2. arXiv:2505.04203  [pdf, ps, other

    cs.GR cs.SD eess.AS

    ELGAR: Expressive Cello Performance Motion Generation for Audio Rendition

    Authors: Zhiping Qiu, Yitong Jin, Yuan Wang, Yi Shi, Chongwu Wang, Chao Tan, Xiaobing Li, Feng Yu, Tao Yu, Qionghai Dai

    Abstract: The art of instrument performance stands as a vivid manifestation of human creativity and emotion. Nonetheless, generating instrument performance motions is a highly challenging task, as it requires not only capturing intricate movements but also reconstructing the complex dynamics of the performer-instrument interaction. While existing works primarily focus on modeling partial body motions, we pr… ▽ More

    Submitted 1 July, 2025; v1 submitted 7 May, 2025; originally announced May 2025.

    Journal ref: SIGGRAPH 2025

  3. arXiv:2504.19091  [pdf, other

    eess.SP

    A Tutorial on MIMO-OFDM ISAC: From Far-Field to Near-Field

    Authors: Qianglong Dai, Yong Zeng, Huizhi Wang, Changsheng You, Chao Zhou, Hongqiang Cheng, Xiaoli Xu, Shi Jin, A. Lee Swindlehurst, Yonina C. Eldar, Robert Schober, Rui Zhang, Xiaohu You

    Abstract: Integrated sensing and communication (ISAC) is one of the key usage scenarios for future sixth-generation (6G) mobile communication networks, where communication and sensing (C&S) services are simultaneously provided through shared wireless spectrum, signal processing modules, hardware, and network infrastructure. Such an integration is strengthened by the technology trends in 6G, such as denser n… ▽ More

    Submitted 26 April, 2025; originally announced April 2025.

  4. arXiv:2504.17816  [pdf, other

    cs.CV eess.IV

    Subject-driven Video Generation via Disentangled Identity and Motion

    Authors: Daneul Kim, Jingxu Zhang, Wonjoon Jin, Sunghyun Cho, Qi Dai, Jaesik Park, Chong Luo

    Abstract: We propose to train a subject-driven customized video generation model through decoupling the subject-specific learning from temporal dynamics in zero-shot without additional tuning. A traditional method for video customization that is tuning-free often relies on large, annotated video datasets, which are computationally expensive and require extensive annotation. In contrast to the previous appro… ▽ More

    Submitted 23 April, 2025; originally announced April 2025.

    Comments: Project Page : https://carpedkm.github.io/projects/disentangled_sub/index.html

  5. arXiv:2502.21036  [pdf, other

    eess.SY

    A Demo of Radar Sensing Aided Rotatable Antenna for Wireless Communication System

    Authors: Qi Dai, Beixiong Zheng, Qiyao Wang, Xue Xiong, Xiaodan Shao, Lipeng Zhu, Rui Zhang

    Abstract: Rotatable antenna (RA) represents a novel antenna architecture that enhances wireless communication system performance by independently or collectively adjusting each antenna's boresight/orientation. In this demonstration, we develop a prototype of radar sensing-aided rotatable antenna that integrates radar sensing with dynamic antenna orientation to enhance wireless communication performance whil… ▽ More

    Submitted 17 April, 2025; v1 submitted 28 February, 2025; originally announced February 2025.

  6. arXiv:2501.15206  [pdf, other

    physics.app-ph cond-mat.dis-nn eess.SY

    Engineering-Oriented Design of Drift-Resilient MTJ Random Number Generator via Hybrid Control Strategies

    Authors: Ran Zhang, Caihua Wan, Yingqian Xu, Xiaohan Li, Raik Hoffmann, Meike Hindenberg, Shiqiang Liu, Dehao Kong, Shilong Xiong, Shikun He, Alptekin Vardar, Qiang Dai, Junlu Gong, Yihui Sun, Zejie Zheng, Thomas Kämpfe, Guoqiang Yu, Xiufeng Han

    Abstract: Magnetic Tunnel Junctions (MTJs) have shown great promise as hardware sources for true random number generation (TRNG) due to their intrinsic stochastic switching behavior. However, practical deployment remains challenged by drift in switching probability caused by thermal fluctuations, device aging, and environmental instability. This work presents an engineering-oriented, drift-resilient MTJ-bas… ▽ More

    Submitted 19 April, 2025; v1 submitted 25 January, 2025; originally announced January 2025.

    Comments: 16 pages, 9 figures, data shared at https://doi.org/10.6084/m9.figshare.28680899.v1

  7. arXiv:2501.03689  [pdf, other

    cs.SD cs.AI eess.AS

    MAJL: A Model-Agnostic Joint Learning Framework for Music Source Separation and Pitch Estimation

    Authors: Haojie Wei, Jun Yuan, Rui Zhang, Quanyu Dai, Yueguo Chen

    Abstract: Music source separation and pitch estimation are two vital tasks in music information retrieval. Typically, the input of pitch estimation is obtained from the output of music source separation. Therefore, existing methods have tried to perform these two tasks simultaneously, so as to leverage the mutually beneficial relationship between both tasks. However, these methods still face two critical ch… ▽ More

    Submitted 7 January, 2025; originally announced January 2025.

  8. arXiv:2412.20083  [pdf, other

    cs.IT eess.SP

    Achieving Full-Bandwidth Sensing Performance with Partial Bandwidth Allocation for ISAC

    Authors: Zhiqiang Xiao, Zhiwen Zhou, Qianglong Dai, Yong Zeng, Fei Yang, Yan Chen

    Abstract: This letter studies an uplink integrated sensing and communication (ISAC) system using discrete Fourier transform spread orthogonal frequency division multiplexing (DFT-s-OFDM) transmission. We try to answer the following fundamental question: With only a fractional bandwidth allocated to the user with sensing task, can the same delay resolution and unambiguous range be achieved as if all bandwidt… ▽ More

    Submitted 28 December, 2024; originally announced December 2024.

  9. arXiv:2409.19835  [pdf, other

    cs.CV eess.IV

    MoCoLSK: Modality Conditioned High-Resolution Downscaling for Land Surface Temperature

    Authors: Qun Dai, Chunyang Yuan, Yimian Dai, Yuxuan Li, Xiang Li, Kang Ni, Jianhui Xu, Xiangbo Shu, Jian Yang

    Abstract: Land Surface Temperature (LST) is a critical parameter for environmental studies, but directly obtaining high spatial resolution LST data remains challenging due to the spatio-temporal trade-off in satellite remote sensing. Guided LST downscaling has emerged as an alternative solution to overcome these limitations, but current methods often neglect spatial non-stationarity, and there is a lack of… ▽ More

    Submitted 2 March, 2025; v1 submitted 29 September, 2024; originally announced September 2024.

    Comments: Accepted by IEEE TGRS

  10. A Deep Learning-Augmented Stand-off Radar Scheme for Rapidly Detecting Tree Defects

    Authors: Jiwei Qian, Yee Hui Lee, Kaixuan Cheng, Qiqi Dai, Mohamed Lokman Mohd Yusof, Daryl Lee, Abdulkadir C. Yucel

    Abstract: Tree defect detection is crucial for the structural health screening of trees. Existing nondestructive testing (NDT) techniques for tree defect detection require time-consuming and labor-intensive measurement campaigns. This discourages their application for the routine structural health screening of whole populations of managed urban trees. To address this issue, this study proposes a deep-learni… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

    Comments: Accepted and to be published in IEEE Transactions on Geoscience and Remote Sensing

  11. arXiv:2405.16850  [pdf, other

    eess.IV cs.CV cs.LG

    UniCompress: Enhancing Multi-Data Medical Image Compression with Knowledge Distillation

    Authors: Runzhao Yang, Yinda Chen, Zhihong Zhang, Xiaoyu Liu, Zongren Li, Kunlun He, Zhiwei Xiong, Jinli Suo, Qionghai Dai

    Abstract: In the field of medical image compression, Implicit Neural Representation (INR) networks have shown remarkable versatility due to their flexible compression ratios, yet they are constrained by a one-to-one fitting approach that results in lengthy encoding times. Our novel method, ``\textbf{UniCompress}'', innovatively extends the compression capabilities of INR by being the first to compress multi… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  12. arXiv:2404.07551  [pdf, other

    eess.IV cs.CV

    Event-Enhanced Snapshot Compressive Videography at 10K FPS

    Authors: Bo Zhang, Jinli Suo, Qionghai Dai

    Abstract: Video snapshot compressive imaging (SCI) encodes the target dynamic scene compactly into a snapshot and reconstructs its high-speed frame sequence afterward, greatly reducing the required data footprint and transmission bandwidth as well as enabling high-speed imaging with a low frame rate intensity camera. In implementation, high-speed dynamics are encoded via temporally varying patterns, and onl… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

  13. arXiv:2403.15853  [pdf

    eess.IV cs.CV

    An edge detection-based deep learning approach for tear meniscus height measurement

    Authors: Kesheng Wang, Kunhui Xu, Xiaoyu Chen, Chunlei He, Jianfeng Zhang, Dexing Kong, Qi Dai, Shoujun Huang

    Abstract: Automatic measurements of tear meniscus height (TMH) have been achieved by using deep learning techniques; however, annotation is significantly influenced by subjective factors and is both time-consuming and labor-intensive. In this paper, we introduce an automatic TMH measurement technique based on edge detection-assisted annotation within a deep learning framework. This method generates mask lab… ▽ More

    Submitted 23 March, 2024; originally announced March 2024.

    Comments: 22 pages, 5 figures

  14. arXiv:2311.13134  [pdf, other

    cs.CV eess.IV

    Lightweight High-Speed Photography Built on Coded Exposure and Implicit Neural Representation of Videos

    Authors: Zhihong Zhang, Runzhao Yang, Jinli Suo, Yuxiao Cheng, Qionghai Dai

    Abstract: The demand for compact cameras capable of recording high-speed scenes with high resolution is steadily increasing. However, achieving such capabilities often entails high bandwidth requirements, resulting in bulky, heavy systems unsuitable for low-capacity platforms. To address this challenge, leveraging a coded exposure setup to encode a frame sequence into a blurry snapshot and subsequently retr… ▽ More

    Submitted 28 August, 2024; v1 submitted 21 November, 2023; originally announced November 2023.

    Comments: Accepted by IJCV

  15. arXiv:2310.01861  [pdf, other

    eess.IV cs.CV cs.GR

    Shifting More Attention to Breast Lesion Segmentation in Ultrasound Videos

    Authors: Junhao Lin, Qian Dai, Lei Zhu, Huazhu Fu, Qiong Wang, Weibin Li, Wenhao Rao, Xiaoyang Huang, Liansheng Wang

    Abstract: Breast lesion segmentation in ultrasound (US) videos is essential for diagnosing and treating axillary lymph node metastasis. However, the lack of a well-established and large-scale ultrasound video dataset with high-quality annotations has posed a persistent challenge for the research community. To overcome this issue, we meticulously curated a US video breast lesion segmentation dataset comprisi… ▽ More

    Submitted 3 October, 2023; originally announced October 2023.

    Comments: 10 pages

  16. 3DInvNet: A Deep Learning-Based 3D Ground-Penetrating Radar Data Inversion

    Authors: Qiqi Dai, Yee Hui Lee, Hai-Han Sun, Genevieve Ow, Mohamed Lokman Mohd Yusof, Abdulkadir C. Yucel

    Abstract: The reconstruction of the 3D permittivity map from ground-penetrating radar (GPR) data is of great importance for mapping subsurface environments and inspecting underground structural integrity. Traditional iterative 3D reconstruction algorithms suffer from strong non-linearity, ill-posedness, and high computational cost. To tackle these issues, a 3D deep learning scheme, called 3DInvNet, is propo… ▽ More

    Submitted 9 May, 2023; originally announced May 2023.

  17. arXiv:2301.10167  [pdf, other

    eess.SP cs.LG physics.optics

    EEG Opto-processor: epileptic seizure detection using diffractive photonic computing units

    Authors: Tao Yan, Maoqi Zhang, Sen Wan, Kaifeng Shang, Haiou Zhang, Xun Cao, Xing Lin, Qionghai Dai

    Abstract: Electroencephalography (EEG) analysis extracts critical information from brain signals, which has provided fundamental support for various applications, including brain-disease diagnosis and brain-computer interface. However, the real-time processing of large-scale EEG signals at high energy efficiency has placed great challenges for electronic processors on edge computing devices. Here, we propos… ▽ More

    Submitted 9 December, 2022; originally announced January 2023.

    Comments: 22 pages, 5 figures

  18. arXiv:2209.15180  [pdf, other

    eess.IV cs.CV

    SCI: A Spectrum Concentrated Implicit Neural Compression for Biomedical Data

    Authors: Runzhao Yang, Tingxiong Xiao, Yuxiao Cheng, Qianni Cao, Jinyuan Qu, Jinli Suo, Qionghai Dai

    Abstract: Massive collection and explosive growth of biomedical data, demands effective compression for efficient storage, transmission and sharing. Readily available visual data compression techniques have been studied extensively but tailored for natural images/videos, and thus show limited performance on biomedical data which are of different features and larger diversity. Emerging implicit neural repres… ▽ More

    Submitted 23 November, 2022; v1 submitted 29 September, 2022; originally announced September 2022.

    Comments: accepted to AAAI2023

    ACM Class: I.4.2; I.2.10

  19. INFWIDE: Image and Feature Space Wiener Deconvolution Network for Non-blind Image Deblurring in Low-Light Conditions

    Authors: Zhihong Zhang, Yuxiao Cheng, Jinli Suo, Liheng Bian, Qionghai Dai

    Abstract: Under low-light environment, handheld photography suffers from severe camera shake under long exposure settings. Although existing deblurring algorithms have shown promising performance on well-exposed blurry images, they still cannot cope with low-light snapshots. Sophisticated noise and saturation regions are two dominating challenges in practical low-light deblurring. In this work, we propose a… ▽ More

    Submitted 17 February, 2023; v1 submitted 17 July, 2022; originally announced July 2022.

    Comments: Accepted by IEEE Trans. Image Process, early access version available at https://ieeexplore.ieee.org/document/10047966

  20. A Deep Learning-Based GPR Forward Solver for Predicting B-Scans of Subsurface Objects

    Authors: Qiqi Dai, Yee Hui Lee, Hai-Han Sun, Jiwei Qian, Genevieve Ow, Mohamed Lokman Mohd Yusof, Abdulkadir C. Yucel

    Abstract: The forward full-wave modeling of ground-penetrating radar (GPR) facilitates the understanding and interpretation of GPR data. Traditional forward solvers require excessive computational resources, especially when their repetitive executions are needed in signal processing and/or machine learning algorithms for GPR data inversion. To alleviate the computational burden, a deep learning-based 2D GPR… ▽ More

    Submitted 13 July, 2022; originally announced July 2022.

  21. DMRF-UNet: A Two-Stage Deep Learning Scheme for GPR Data Inversion under Heterogeneous Soil Conditions

    Authors: Qiqi Dai, Yee Hui Lee, Hai-Han Sun, Genevieve Ow, Mohamed Lokman Mohd Yusof, Abdulkadir C. Yucel

    Abstract: Traditional ground-penetrating radar (GPR) data inversion leverages iterative algorithms which suffer from high computation costs and low accuracy when applied to complex subsurface scenarios. Existing deep learning-based methods focus on the ideal homogeneous subsurface environments and ignore the interference due to clutters and noise in real-world heterogeneous environments. To address these is… ▽ More

    Submitted 16 May, 2022; originally announced May 2022.

  22. A Dual Sensor Computational Camera for High Quality Dark Videography

    Authors: Yuxiao Cheng, Runzhao Yang, Zhihong Zhang, Jinli Suo, Qionghai Dai

    Abstract: Videos captured under low light conditions suffer from severe noise. A variety of efforts have been devoted to image/video noise suppression and made large progress. However, in extremely dark scenarios, extensive photon starvation would hamper precise noise modeling. Instead, developing an imaging system collecting more photons is a more effective way for high-quality video capture under low illu… ▽ More

    Submitted 11 April, 2022; originally announced April 2022.

    Journal ref: Information Fusion Volume 93, May 2023, Pages 429-440

  23. Estimating Parameters of the Tree Root in Heterogeneous Soil Environments via Mask-Guided Multi-Polarimetric Integration Neural Network

    Authors: Hai-Han Sun, Yee Hui Lee, Qiqi Dai, Chongyi Li, Genevieve Ow, Mohamed Lokman Mohd Yusof, Abdulkadir C. Yucel

    Abstract: Ground-penetrating radar (GPR) has been used as a non-destructive tool for tree root inspection. Estimating root-related parameters from GPR radargrams greatly facilitates root health monitoring and imaging. However, the task of estimating root-related parameters is challenging as the root reflection is a complex function of multiple root parameters and root orientations. Existing methods can only… ▽ More

    Submitted 26 December, 2021; originally announced December 2021.

    Comments: 14 pages, 12 figures

  24. arXiv:2111.09103  [pdf, other

    eess.IV cs.CV

    Fast and Light-Weight Network for Single Frame Structured Illumination Microscopy Super-Resolution

    Authors: Xi Cheng, Jun Li, Qiang Dai, Zhenyong Fu, Jian Yang

    Abstract: Structured illumination microscopy (SIM) is an important super-resolution based microscopy technique that breaks the diffraction limit and enhances optical microscopy systems. With the development of biology and medical engineering, there is a high demand for real-time and robust SIM imaging under extreme low light and short exposure environments. Existing SIM techniques typically require multiple… ▽ More

    Submitted 17 November, 2021; originally announced November 2021.

    Comments: 9 pages

  25. arXiv:2109.08880  [pdf, other

    cs.CV cs.AI eess.IV

    Computational Imaging and Artificial Intelligence: The Next Revolution of Mobile Vision

    Authors: Jinli Suo, Weihang Zhang, Jin Gong, Xin Yuan, David J. Brady, Qionghai Dai

    Abstract: Signal capture stands in the forefront to perceive and understand the environment and thus imaging plays the pivotal role in mobile vision. Recent explosive progresses in Artificial Intelligence (AI) have shown great potential to develop advanced mobile platforms with new imaging devices. Traditional imaging systems based on the "capturing images first and processing afterwards" mechanism cannot m… ▽ More

    Submitted 18 September, 2021; originally announced September 2021.

  26. arXiv:2107.01422  [pdf, other

    physics.optics cs.CV eess.IV q-bio.TO

    Imaging dynamics beneath turbid media via parallelized single-photon detection

    Authors: Shiqi Xu, Xi Yang, Wenhui Liu, Joakim Jonsson, Ruobing Qian, Pavan Chandra Konda, Kevin C. Zhou, Lucas Kreiss, Qionghai Dai, Haoqian Wang, Edouard Berrocal, Roarke Horstmeyer

    Abstract: Noninvasive optical imaging through dynamic scattering media has numerous important biomedical applications but still remains a challenging task. While standard diffuse imaging methods measure optical absorption or fluorescent emission, it is also well-established that the temporal correlation of scattered coherent light diffuses through tissue much like optical intensity. Few works to date, howev… ▽ More

    Submitted 12 June, 2022; v1 submitted 3 July, 2021; originally announced July 2021.

  27. arXiv:2106.15765  [pdf, other

    eess.IV cs.CV physics.optics

    10-mega pixel snapshot compressive imaging with a hybrid coded aperture

    Authors: Zhihong Zhang, Chao Deng, Yang Liu, Xin Yuan, Jinli Suo, Qionghai Dai

    Abstract: High resolution images are widely used in our daily life, whereas high-speed video capture is challenging due to the low frame rate of cameras working at the high resolution mode. Digging deeper, the main bottleneck lies in the low throughput of existing imaging systems. Towards this end, snapshot compressive imaging (SCI) was proposed as a promising solution to improve the throughput of imaging s… ▽ More

    Submitted 15 August, 2021; v1 submitted 29 June, 2021; originally announced June 2021.

    Comments: 11 pages, 8 figures, accepted by Photonics Research

  28. arXiv:2106.00682  [pdf

    physics.med-ph eess.IV

    Prostate cancer histopathology with label-free multispectral deep UV microscopy quantifies phenotypes of tumor grade and aggressiveness

    Authors: Soheil Soltani, Ashkan Ojaghi, Hui Qiao, Nischita Kaza, Xinyang Li, Qionghai Dai, Adeboye O Osunkoya, Francisco E Robles

    Abstract: Identifying prostate cancer patients that are harboring aggressive forms of prostate cancer remains a significant clinical challenge. To shed light on this problem, we develop an approach based on multispectral deep-ultraviolet (UV) microscopy that provides novel quantitative insight into the aggressiveness and grade of this disease. First, we find that UV spectral signatures from endogenous molec… ▽ More

    Submitted 1 June, 2021; originally announced June 2021.

  29. arXiv:2104.03078  [pdf, other

    eess.IV cs.CV

    Universal and Flexible Optical Aberration Correction Using Deep-Prior Based Deconvolution

    Authors: Xiu Li, Jinli Suo, Weihang Zhang, Xin Yuan, Qionghai Dai

    Abstract: High quality imaging usually requires bulky and expensive lenses to compensate geometric and chromatic aberrations. This poses high constraints on the optical hash or low cost applications. Although one can utilize algorithmic reconstruction to remove the artifacts of low-end lenses, the degeneration from optical aberrations is spatially varying and the computation has to trade off efficiency for… ▽ More

    Submitted 18 August, 2021; v1 submitted 7 April, 2021; originally announced April 2021.

    Comments: ICCV2021

  30. Light Field Reconstruction Using Convolutional Network on EPI and Extended Applications

    Authors: Gaochang Wu, Yebin Liu, Lu Fang, Qionghai Dai, Tianyou Chai

    Abstract: In this paper, a novel convolutional neural network (CNN)-based framework is developed for light field reconstruction from a sparse set of views. We indicate that the reconstruction can be efficiently modeled as angular restoration on an epipolar plane image (EPI). The main problem in direct reconstruction on the EPI involves an information asymmetry between the spatial and angular dimensions, whe… ▽ More

    Submitted 24 March, 2021; originally announced March 2021.

    Comments: Published in IEEE TPAMI, 2019

    Journal ref: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2019

  31. arXiv:2103.05843  [pdf, other

    eess.IV cs.CV cs.MM

    Learning to Estimate Kernel Scale and Orientation of Defocus Blur with Asymmetric Coded Aperture

    Authors: Jisheng Li, Qi Dai, Jiangtao Wen

    Abstract: Consistent in-focus input imagery is an essential precondition for machine vision systems to perceive the dynamic environment. A defocus blur severely degrades the performance of vision systems. To tackle this problem, we propose a deep-learning-based framework estimating the kernel scale and orientation of the defocus blur to adjust lens focus rapidly. Our pipeline utilizes 3D ConvNet for a varia… ▽ More

    Submitted 9 March, 2021; originally announced March 2021.

  32. arXiv:2101.04822  [pdf, other

    eess.IV cs.CV

    Plug-and-Play Algorithms for Video Snapshot Compressive Imaging

    Authors: Xin Yuan, Yang Liu, Jinli Suo, Frédo Durand, Qionghai Dai

    Abstract: We consider the reconstruction problem of video snapshot compressive imaging (SCI), which captures high-speed videos using a low-speed 2D sensor (detector). The underlying principle of SCI is to modulate sequential high-speed frames with different masks and then these encoded frames are integrated into a snapshot on the sensor and thus the sensor can be of low-speed. On one hand, video SCI enjoys… ▽ More

    Submitted 12 January, 2021; originally announced January 2021.

    Comments: 18 pages, 12 figures and 4 tables. Journal extension of arXiv:2003.13654. Code available at https://github.com/liuyang12/PnP-SCI_python

  33. arXiv:2008.11659  [pdf

    eess.IV cs.LG cs.NE physics.optics

    Large-scale neuromorphic optoelectronic computing with a reconfigurable diffractive processing unit

    Authors: Tiankuang Zhou, Xing Lin, Jiamin Wu, Yitong Chen, Hao Xie, Yipeng Li, Jintao Fan, Huaqiang Wu, Lu Fang, Qionghai Dai

    Abstract: Application-specific optical processors have been considered disruptive technologies for modern computing that can fundamentally accelerate the development of artificial intelligence (AI) by offering substantially improved computing performance. Recent advancements in optical neural network architectures for neural information processing have been applied to perform various machine learning tasks.… ▽ More

    Submitted 26 August, 2020; originally announced August 2020.

  34. arXiv:2005.12690  [pdf, other

    cs.CV cs.LG eess.IV

    SurfaceNet+: An End-to-end 3D Neural Network for Very Sparse Multi-view Stereopsis

    Authors: Mengqi Ji, Jinzhi Zhang, Qionghai Dai, Lu Fang

    Abstract: Multi-view stereopsis (MVS) tries to recover the 3D model from 2D images. As the observations become sparser, the significant 3D information loss makes the MVS problem more challenging. Instead of only focusing on densely sampled conditions, we investigate sparse-MVS with large baseline angles since the sparser sensation is more practical and more cost-efficient. By investigating various observati… ▽ More

    Submitted 26 May, 2020; originally announced May 2020.

    Comments: Accepted by IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), May 2020

    Journal ref: 2020, IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)

  35. arXiv:2005.12597  [pdf, other

    eess.IV cs.CV

    Perceptual Extreme Super Resolution Network with Receptive Field Block

    Authors: Taizhang Shang, Qiuju Dai, Shengchen Zhu, Tong Yang, Yandong Guo

    Abstract: Perceptual Extreme Super-Resolution for single image is extremely difficult, because the texture details of different images vary greatly. To tackle this difficulty, we develop a super resolution network with receptive field block based on Enhanced SRGAN. We call our network RFB-ESRGAN. The key contributions are listed as follows. First, for the purpose of extracting multi-scale information and en… ▽ More

    Submitted 26 May, 2020; originally announced May 2020.

    Comments: CVPRW 2020 accepted oral, 8 pages,45 figures

  36. arXiv:2005.01056  [pdf, other

    eess.IV cs.CV

    NTIRE 2020 Challenge on Perceptual Extreme Super-Resolution: Methods and Results

    Authors: Kai Zhang, Shuhang Gu, Radu Timofte, Taizhang Shang, Qiuju Dai, Shengchen Zhu, Tong Yang, Yandong Guo, Younghyun Jo, Sejong Yang, Seon Joo Kim, Lin Zha, Jiande Jiang, Xinbo Gao, Wen Lu, Jing Liu, Kwangjin Yoon, Taegyun Jeon, Kazutoshi Akita, Takeru Ooba, Norimichi Ukita, Zhipeng Luo, Yuehan Yao, Zhenyu Xu, Dongliang He , et al. (38 additional authors not shown)

    Abstract: This paper reviews the NTIRE 2020 challenge on perceptual extreme super-resolution with focus on proposed solutions and results. The challenge task was to super-resolve an input image with a magnification factor 16 based on a set of prior examples of low and corresponding high resolution images. The goal is to obtain a network design capable to produce high resolution results with the best percept… ▽ More

    Submitted 3 May, 2020; originally announced May 2020.

    Comments: CVPRW 2020

  37. arXiv:2003.14237  [pdf

    eess.IV physics.optics

    Single-pixel coherent diffraction imaging

    Authors: Meng Li, Liheng Bian, Guoan Zheng, Andrew Maiden, Yang Liu, Yiming Li, Qionghai Dai, Jun Zhang

    Abstract: Complex-field imaging is indispensable for numerous applications at wavelengths from X-ray to THz, with amplitude describing transmittance (or reflectivity) and phase revealing intrinsic structure of the target object. Coherent diffraction imaging (CDI) employs iterative phase retrieval algorithms to process diffraction measurements and is the predominant non-interferometric method to image comple… ▽ More

    Submitted 29 March, 2020; originally announced March 2020.

  38. arXiv:2003.13654  [pdf, other

    eess.IV cs.CV

    Plug-and-Play Algorithms for Large-scale Snapshot Compressive Imaging

    Authors: Xin Yuan, Yang Liu, Jinli Suo, Qionghai Dai

    Abstract: Snapshot compressive imaging (SCI) aims to capture the high-dimensional (usually 3D) images using a 2D sensor (detector) in a single snapshot. Though enjoying the advantages of low-bandwidth, low-power and low-cost, applying SCI to large-scale problems (HD or UHD videos) in our daily life is still challenging. The bottleneck lies in the reconstruction algorithms; they are either too slow (iterativ… ▽ More

    Submitted 17 July, 2020; v1 submitted 30 March, 2020; originally announced March 2020.

    Comments: CVPR 2020. Corrected a proof of convergence in previous version

  39. arXiv:1811.03455  [pdf, other

    eess.IV

    High fidelity single-pixel imaging

    Authors: Chao Deng, Xuemei Hu, Xiaoxu Li, Jinli Suo, Zhili Zhang, Qionghai Dai

    Abstract: Single-pixel imaging (SPI) is an emerging technique which has attracts wide attention in various research fields. However, restricted by the low reconstruction quality and large amount of measurements, the practical application is still in its infancy. Inspired by the fact that natural scenes exhibit unique degenerate structures in the low dimensional subspace, we propose to take advantage of the… ▽ More

    Submitted 7 November, 2018; originally announced November 2018.

    Comments: 5 pages, 6 figures