Skip to main content

Showing 1–9 of 9 results for author: Martin, R R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.09143  [pdf, other

    cs.GR

    A New Split Algorithm for 3D Gaussian Splatting

    Authors: Qiyuan Feng, Gengchen Cao, Haoxiang Chen, Tai-Jiang Mu, Ralph R. Martin, Shi-Min Hu

    Abstract: 3D Gaussian splatting models, as a novel explicit 3D representation, have been applied in many domains recently, such as explicit geometric editing and geometry generation. Progress has been rapid. However, due to their mixed scales and cluttered shapes, 3D Gaussian splatting models can produce a blurred or needle-like effect near the surface. At the same time, 3D Gaussian splatting models tend to… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

    Comments: 11 pages, 10 figures

  2. arXiv:2301.06962  [pdf, other

    cs.CV

    Long Range Pooling for 3D Large-Scale Scene Understanding

    Authors: Xiang-Li Li, Meng-Hao Guo, Tai-Jiang Mu, Ralph R. Martin, Shi-Min Hu

    Abstract: Inspired by the success of recent vision transformers and large kernel design in convolutional neural networks (CNNs), in this paper, we analyze and explore essential reasons for their success. We claim two factors that are critical for 3D large-scale scene understanding: a larger receptive field and operations with greater non-linearity. The former is responsible for providing long range contexts… ▽ More

    Submitted 17 January, 2023; originally announced January 2023.

  3. Attention Mechanisms in Computer Vision: A Survey

    Authors: Meng-Hao Guo, Tian-Xing Xu, Jiang-Jiang Liu, Zheng-Ning Liu, Peng-Tao Jiang, Tai-Jiang Mu, Song-Hai Zhang, Ralph R. Martin, Ming-Ming Cheng, Shi-Min Hu

    Abstract: Humans can naturally and effectively find salient regions in complex scenes. Motivated by this observation, attention mechanisms were introduced into computer vision with the aim of imitating this aspect of the human visual system. Such an attention mechanism can be regarded as a dynamic weight adjustment process based on features of the input image. Attention mechanisms have achieved great succes… ▽ More

    Submitted 15 November, 2021; originally announced November 2021.

    Comments: 27 pages, 9 figures

    Journal ref: Computational Visual Media, 2022, Vol. 8, No. 3, 331-368

  4. arXiv:2111.03420  [pdf, other

    cs.CV

    Sampling Equivariant Self-attention Networks for Object Detection in Aerial Images

    Authors: Guo-Ye Yang, Xiang-Li Li, Ralph R. Martin, Shi-Min Hu

    Abstract: Objects in aerial images have greater variations in scale and orientation than in typical images, so detection is more difficult. Convolutional neural networks use a variety of frequency- and orientation-specific kernels to identify objects subject to different transformations; these require many parameters. Sampling equivariant networks can adjust sampling from input feature maps according to the… ▽ More

    Submitted 5 November, 2021; originally announced November 2021.

  5. arXiv:2106.02285  [pdf, other

    cs.CV cs.GR cs.LG

    Subdivision-Based Mesh Convolution Networks

    Authors: Shi-Min Hu, Zheng-Ning Liu, Meng-Hao Guo, Jun-Xiong Cai, Jiahui Huang, Tai-Jiang Mu, Ralph R. Martin

    Abstract: Convolutional neural networks (CNNs) have made great breakthroughs in 2D computer vision. However, their irregular structure makes it hard to harness the potential of CNNs directly on meshes. A subdivision surface provides a hierarchical multi-resolution structure, in which each face in a closed 2-manifold triangle mesh is exactly adjacent to three faces. Motivated by these two observations, this… ▽ More

    Submitted 29 December, 2021; v1 submitted 4 June, 2021; originally announced June 2021.

    Comments: Codes are available in https://github.com/lzhengning/SubdivNet

    ACM Class: I.3.5

    Journal ref: ACM Transactions on Graphics, Volume 41, Issue 3, 2022, Article No.: 25, pp 1-16

  6. arXiv:2105.15078  [pdf, other

    cs.CV cs.LG

    Can Attention Enable MLPs To Catch Up With CNNs?

    Authors: Meng-Hao Guo, Zheng-Ning Liu, Tai-Jiang Mu, Dun Liang, Ralph R. Martin, Shi-Min Hu

    Abstract: In the first week of May, 2021, researchers from four different institutions: Google, Tsinghua University, Oxford University and Facebook, shared their latest work [16, 7, 12, 17] on arXiv.org almost at the same time, each proposing new learning architectures, consisting mainly of linear layers, claiming them to be comparable, or even superior to convolutional-based models. This sparked immediate… ▽ More

    Submitted 31 May, 2021; originally announced May 2021.

    Comments: Computational Visual Media, 2021, accepted. 4 pages, 1 figure

  7. PCT: Point cloud transformer

    Authors: Meng-Hao Guo, Jun-Xiong Cai, Zheng-Ning Liu, Tai-Jiang Mu, Ralph R. Martin, Shi-Min Hu

    Abstract: The irregular domain and lack of ordering make it challenging to design deep neural networks for point cloud processing. This paper presents a novel framework named Point Cloud Transformer(PCT) for point cloud learning. PCT is based on Transformer, which achieves huge success in natural language processing and displays great potential in image processing. It is inherently permutation invariant for… ▽ More

    Submitted 6 June, 2021; v1 submitted 17 December, 2020; originally announced December 2020.

    Comments: 11 pages, 5 figures

    Journal ref: Computational Visual Media, 2021, Vol. 7, No. 2, Pages: 187 - 199

  8. arXiv:2003.08763  [pdf

    cs.CV cs.IR cs.LG stat.ML

    Shape retrieval of non-rigid 3d human models

    Authors: David Pickup, Xianfang Sun, Paul L Rosin, Ralph R Martin, Z Cheng, Zhouhui Lian, Masaki Aono, A Ben Hamza, A Bronstein, M Bronstein, S Bu, Umberto Castellani, S Cheng, Valeria Garro, Andrea Giachetti, Afzal Godil, Luca Isaia, J Han, Henry Johan, L Lai, Bo Li, C Li, Haisheng Li, Roee Litman, X Liu , et al. (6 additional authors not shown)

    Abstract: 3D models of humans are commonly used within computer graphics and vision, and so the ability to distinguish between body shapes is an important shape retrieval problem. We extend our recent paper which provided a benchmark for testing non-rigid 3D shape retrieval algorithms on 3D human models. This benchmark provided a far stricter challenge than previous shape benchmarks. We have added 145 new m… ▽ More

    Submitted 1 March, 2020; originally announced March 2020.

    Comments: International Journal of Computer Vision, 2016

  9. On difference graphs and the local dimension of posets

    Authors: Jinha Kim, Ryan R. Martin, Tomáš Masařík, Warren Shull, Heather C. Smith, Andrew Uzzell, Zhiyu Wang

    Abstract: The dimension of a partially-ordered set (poset), introduced by Dushnik and Miller (1941), has been studied extensively in the literature. Recently, Ueckerdt (2016) proposed a variation called local dimension which makes use of partial linear extensions. While local dimension is bounded above by dimension, they can be arbitrarily far apart as the dimension of the standard example is $n$ while its… ▽ More

    Submitted 22 March, 2018; originally announced March 2018.

    Comments: 13 pages, 1 figure

    MSC Class: 06A07; 05C70

    Journal ref: European Journal of Combinatorics 86, 1--13, 2020