Skip to main content

Showing 1–10 of 10 results for author: Ren, M

Searching in archive eess. Search in all archives.
.
  1. arXiv:2505.01212  [pdf, other

    cs.CV eess.IV

    High Dynamic Range Novel View Synthesis with Single Exposure

    Authors: Kaixuan Zhang, Hu Wang, Minxian Li, Mingwu Ren, Mao Ye, Xiatian Zhu

    Abstract: High Dynamic Range Novel View Synthesis (HDR-NVS) aims to establish a 3D scene HDR model from Low Dynamic Range (LDR) imagery. Typically, multiple-exposure LDR images are employed to capture a wider range of brightness levels in a scene, as a single LDR image cannot represent both the brightest and darkest regions simultaneously. While effective, this multiple-exposure HDR-NVS approach has signifi… ▽ More

    Submitted 19 May, 2025; v1 submitted 2 May, 2025; originally announced May 2025.

    Comments: It has been accepted by ICML 2025

  2. arXiv:2409.12311  [pdf, other

    cs.RO eess.SY

    Towards Closing the Loop in Robotic Pollination for Indoor Farming via Autonomous Microscopic Inspection

    Authors: Chuizheng Kong, Alex Qiu, Idris Wibowo, Marvin Ren, Aishik Dhori, Kai-Shu Ling, Ai-Ping Hu, Shreyas Kousik

    Abstract: Effective pollination is a key challenge for indoor farming, since bees struggle to navigate without the sun. While a variety of robotic system solutions have been proposed, it remains difficult to autonomously check that a flower has been sufficiently pollinated to produce high-quality fruit, which is especially critical for self-pollinating crops such as strawberries. To this end, this work prop… ▽ More

    Submitted 18 September, 2024; originally announced September 2024.

  3. arXiv:2405.03178  [pdf, other

    cs.SD eess.AS

    POPDG: Popular 3D Dance Generation with PopDanceSet

    Authors: Zhenye Luo, Min Ren, Xuecai Hu, Yongzhen Huang, Li Yao

    Abstract: Generating dances that are both lifelike and well-aligned with music continues to be a challenging task in the cross-modal domain. This paper introduces PopDanceSet, the first dataset tailored to the preferences of young audiences, enabling the generation of aesthetically oriented dances. And it surpasses the AIST++ dataset in music genre diversity and the intricacy and depth of dance movements. M… ▽ More

    Submitted 27 December, 2024; v1 submitted 6 May, 2024; originally announced May 2024.

    Comments: Accepted by IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2024

  4. arXiv:2404.04904  [pdf, other

    cs.SD cs.AI eess.AS

    Cross-Domain Audio Deepfake Detection: Dataset and Analysis

    Authors: Yuang Li, Min Zhang, Mengxin Ren, Miaomiao Ma, Daimeng Wei, Hao Yang

    Abstract: Audio deepfake detection (ADD) is essential for preventing the misuse of synthetic voices that may infringe on personal rights and privacy. Recent zero-shot text-to-speech (TTS) models pose higher risks as they can clone voices with a single utterance. However, the existing ADD datasets are outdated, leading to suboptimal generalization of detection models. In this paper, we construct a new cross-… ▽ More

    Submitted 20 September, 2024; v1 submitted 7 April, 2024; originally announced April 2024.

  5. arXiv:2301.10624  [pdf, ps, other

    cs.IT eess.SP

    Energy-Delay Tradeoff in Helper-Assisted NOMA-MEC Systems: A Four-Sided Matching Algorithm

    Authors: Mengmeng Ren, Long Yang, Hai Jiang, Jian Chen, Yuchen Zhou

    Abstract: This paper designs a helper-assisted resource allocation strategy in non-orthogonal multiple access (NOMA)-enabled mobile edge computing (MEC) systems, in order to guarantee the quality of service (QoS) of the energy/delay-sensitive user equipments (UEs). To achieve a tradeoff between the energy consumption and the delay, we introduce a novel performance metric, called \emph{energy-delay tradeoff}… ▽ More

    Submitted 25 January, 2023; originally announced January 2023.

  6. arXiv:2106.13188  [pdf, other

    eess.IV cs.CV cs.LG

    Q-space Conditioned Translation Networks for Directional Synthesis of Diffusion Weighted Images from Multi-modal Structural MRI

    Authors: Mengwei Ren, Heejong Kim, Neel Dey, Guido Gerig

    Abstract: Current deep learning approaches for diffusion MRI modeling circumvent the need for densely-sampled diffusion-weighted images (DWIs) by directly predicting microstructural indices from sparsely-sampled DWIs. However, they implicitly make unrealistic assumptions of static $q$-space sampling during training and reconstruction. Further, such approaches can restrict downstream usage of variably sample… ▽ More

    Submitted 24 June, 2021; originally announced June 2021.

    Comments: Accepted by MICCAI 2021. Project page: https://heejongkim.com/dwi-synthesis; Code: https://github.com/mengweiren/q-space-conditioned-dwi-synthesis

  7. arXiv:2105.04349  [pdf, other

    cs.CV cs.LG eess.IV

    Generative Adversarial Registration for Improved Conditional Deformable Templates

    Authors: Neel Dey, Mengwei Ren, Adrian V. Dalca, Guido Gerig

    Abstract: Deformable templates are essential to large-scale medical image registration, segmentation, and population analysis. Current conventional and deep network-based methods for template construction use only regularized registration objectives and often yield templates with blurry and/or anatomically implausible appearance, confounding downstream biomedical interpretation. We reformulate deformable re… ▽ More

    Submitted 17 March, 2022; v1 submitted 7 May, 2021; originally announced May 2021.

    Comments: ICCV 2021 camera-ready. 24 pages, 15 figures. Project page: https://www.neeldey.com/deformable-templates/ Code: https://github.com/neel-dey/Atlas-GAN

    Journal ref: Proceedings of the IEEE/CVF International Conference on Computer Vision 2021

  8. Segmentation-Renormalized Deep Feature Modulation for Unpaired Image Harmonization

    Authors: Mengwei Ren, Neel Dey, James Fishbaugh, Guido Gerig

    Abstract: Deep networks are now ubiquitous in large-scale multi-center imaging studies. However, the direct aggregation of images across sites is contraindicated for downstream statistical and deep learning-based image analysis due to inconsistent contrast, resolution, and noise. To this end, in the absence of paired data, variations of Cycle-consistent Generative Adversarial Networks have been used to harm… ▽ More

    Submitted 15 February, 2021; v1 submitted 11 February, 2021; originally announced February 2021.

    Comments: Accepted by IEEE Transactions on Medical Imaging. Code available at https://github.com/mengweiren/segmentation-renormalized-harmonization

  9. arXiv:2009.00294  [pdf, other

    eess.IV cs.CV

    Recognition Oriented Iris Image Quality Assessment in the Feature Space

    Authors: Leyuan Wang, Kunbo Zhang, Min Ren, Yunlong Wang, Zhenan Sun

    Abstract: A large portion of iris images captured in real world scenarios are poor quality due to the uncontrolled environment and the non-cooperative subject. To ensure that the recognition algorithm is not affected by low-quality images, traditional hand-crafted factors based methods discard most images, which will cause system timeout and disrupt user experience. In this paper, we propose a recognition-o… ▽ More

    Submitted 27 September, 2020; v1 submitted 1 September, 2020; originally announced September 2020.

  10. arXiv:1512.00399  [pdf, ps, other

    eess.SY

    Joint Group Testing of Time-varying Faulty Sensors and System State Estimation in Large Sensor Networks

    Authors: Mengqi Ren, Ruixin Niu

    Abstract: The problem of faulty sensor detection is investigated in large sensor networks where the sensor faults are sparse and time-varying, such as those caused by attacks launched by an adversary. Group testing and the Kalman filter are designed jointly to perform real time system state estimation and time-varying faulty sensor detection with a small number of tests. Numerical results show that the faul… ▽ More

    Submitted 1 December, 2015; originally announced December 2015.

    Comments: 5 pages, 3 figures, and 2 tables