Skip to main content

Showing 1–7 of 7 results for author: Hetang, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.10279  [pdf, other

    cs.CV

    EucliDreamer: Fast and High-Quality Texturing for 3D Models with Depth-Conditioned Stable Diffusion

    Authors: Cindy Le, Congrui Hetang, Chendi Lin, Ang Cao, Yihui He

    Abstract: We present EucliDreamer, a simple and effective method to generate textures for 3D models given text prompts and meshes. The texture is parametrized as an implicit function on the 3D surface, which is optimized with the Score Distillation Sampling (SDS) process and differentiable rendering. To generate high-quality textures, we leverage a depth-conditioned Stable Diffusion model guided by the dept… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

    Comments: Short version of arXiv:2311.15573

  2. arXiv:2403.16051  [pdf, other

    cs.CV

    Segment Anything Model for Road Network Graph Extraction

    Authors: Congrui Hetang, Haoru Xue, Cindy Le, Tianwei Yue, Wenping Wang, Yihui He

    Abstract: We propose SAM-Road, an adaptation of the Segment Anything Model (SAM) for extracting large-scale, vectorized road network graphs from satellite imagery. To predict graph geometry, we formulate it as a dense semantic segmentation task, leveraging the inherent strengths of SAM. The image encoder of SAM is fine-tuned to produce probability masks for roads and intersections, from which the graph vert… ▽ More

    Submitted 12 April, 2024; v1 submitted 24 March, 2024; originally announced March 2024.

    Comments: Accepted by IEEE/CVF Computer Vision and Pattern Recognition Conference (CVPR) 2024, 2nd Workshop on Scene Graphs and Graph Representation Learning

  3. arXiv:2311.15573  [pdf, other

    cs.CV cs.GR

    EucliDreamer: Fast and High-Quality Texturing for 3D Models with Stable Diffusion Depth

    Authors: Cindy Le, Congrui Hetang, Chendi Lin, Ang Cao, Yihui He

    Abstract: This paper presents a novel method to generate textures for 3D models given text prompts and 3D meshes. Additional depth information is taken into account to perform the Score Distillation Sampling (SDS) process with depth conditional Stable Diffusion. We ran our model over the open-source dataset Objaverse and conducted a user study to compare the results with those of various 3D texturing method… ▽ More

    Submitted 13 March, 2024; v1 submitted 27 November, 2023; originally announced November 2023.

  4. arXiv:2311.01065  [pdf, other

    cs.CV

    Novel View Synthesis from a Single RGBD Image for Indoor Scenes

    Authors: Congrui Hetang, Yuping Wang

    Abstract: In this paper, we propose an approach for synthesizing novel view images from a single RGBD (Red Green Blue-Depth) input. Novel view synthesis (NVS) is an interesting computer vision task with extensive applications. Methods using multiple images has been well-studied, exemplary ones include training scene-specific Neural Radiance Fields (NeRF), or leveraging multi-view stereo (MVS) and 3D renderi… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

    Comments: 2nd International Conference on Image Processing, Computer Vision and Machine Learning, November 2023

  5. Depth-wise Decomposition for Accelerating Separable Convolutions in Efficient Convolutional Neural Networks

    Authors: Yihui He, Jianing Qian, Jianren Wang, Cindy X. Le, Congrui Hetang, Qi Lyu, Wenping Wang, Tianwei Yue

    Abstract: Very deep convolutional neural networks (CNNs) have been firmly established as the primary methods for many computer vision tasks. However, most state-of-the-art CNNs are large, which results in high inference latency. Recently, depth-wise separable convolution has been proposed for image recognition tasks on computationally limited platforms such as robotics and self-driving cars. Though it is mu… ▽ More

    Submitted 23 September, 2023; v1 submitted 21 October, 2019; originally announced October 2019.

    Journal ref: Adv. Artif. Intell. Mach. Learn., 3 (4):1699-1719, 2023

  6. arXiv:1712.05896  [pdf, other

    cs.CV

    Impression Network for Video Object Detection

    Authors: Congrui Hetang, Hongwei Qin, Shaohui Liu, Junjie Yan

    Abstract: Video object detection is more challenging compared to image object detection. Previous works proved that applying object detector frame by frame is not only slow but also inaccurate. Visual clues get weakened by defocus and motion blur, causing failure on corresponding frames. Multi-frame feature fusion methods proved effective in improving the accuracy, but they dramatically sacrifice the speed.… ▽ More

    Submitted 15 December, 2017; originally announced December 2017.

    Comments: Tech Report

  7. arXiv:1711.08766  [pdf, other

    cs.CV

    Region-based Quality Estimation Network for Large-scale Person Re-identification

    Authors: Guanglu Song, Biao Leng, Yu Liu, Congrui Hetang, Shaofan Cai

    Abstract: One of the major restrictions on the performance of video-based person re-id is partial noise caused by occlusion, blur and illumination. Since different spatial regions of a single frame have various quality, and the quality of the same region also varies across frames in a tracklet, a good way to address the problem is to effectively aggregate complementary information from all frames in a seque… ▽ More

    Submitted 21 December, 2017; v1 submitted 23 November, 2017; originally announced November 2017.

    Comments: Accepted by AAAI 2018