Skip to main content

Showing 1–10 of 10 results for author: Pontes, J K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.16789  [pdf, other

    cs.CV cs.AI cs.LG

    What Matters in Range View 3D Object Detection

    Authors: Benjamin Wilson, Nicholas Autio Mitchell, Jhony Kaesemodel Pontes, James Hays

    Abstract: Lidar-based perception pipelines rely on 3D object detection models to interpret complex scenes. While multiple representations for lidar exist, the range-view is enticing since it losslessly encodes the entire lidar sensor output. In this work, we achieve state-of-the-art amongst range-view 3D object detection models without using multiple techniques proposed in past range-view literature. We exp… ▽ More

    Submitted 25 July, 2024; v1 submitted 23 July, 2024; originally announced July 2024.

    Comments: Fixed broken link

  2. arXiv:2304.09121  [pdf, other

    cs.CV

    Fast Neural Scene Flow

    Authors: Xueqian Li, Jianqiao Zheng, Francesco Ferroni, Jhony Kaesemodel Pontes, Simon Lucey

    Abstract: Neural Scene Flow Prior (NSFP) is of significant interest to the vision community due to its inherent robustness to out-of-distribution (OOD) effects and its ability to deal with dense lidar points. The approach utilizes a coordinate neural network to estimate scene flow at runtime, without any training. However, it is up to 100 times slower than current state-of-the-art learning methods. In other… ▽ More

    Submitted 29 August, 2023; v1 submitted 18 April, 2023; originally announced April 2023.

    Comments: 17 pages, 11 figures, 6 tables

  3. arXiv:2301.00493  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    Argoverse 2: Next Generation Datasets for Self-Driving Perception and Forecasting

    Authors: Benjamin Wilson, William Qi, Tanmay Agarwal, John Lambert, Jagjeet Singh, Siddhesh Khandelwal, Bowen Pan, Ratnesh Kumar, Andrew Hartnett, Jhony Kaesemodel Pontes, Deva Ramanan, Peter Carr, James Hays

    Abstract: We introduce Argoverse 2 (AV2) - a collection of three datasets for perception and forecasting research in the self-driving domain. The annotated Sensor Dataset contains 1,000 sequences of multimodal data, encompassing high-resolution imagery from seven ring cameras, and two stereo cameras in addition to lidar point clouds, and 6-DOF map-aligned pose. Sequences contain 3D cuboid annotations for 26… ▽ More

    Submitted 1 January, 2023; originally announced January 2023.

    Comments: Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks

  4. arXiv:2111.01253  [pdf, other

    cs.CV

    Neural Scene Flow Prior

    Authors: Xueqian Li, Jhony Kaesemodel Pontes, Simon Lucey

    Abstract: Before the deep learning revolution, many perception algorithms were based on runtime optimization in conjunction with a strong prior/regularization penalty. A prime example of this in computer vision is optical and scene flow. Supervised learning has largely displaced the need for explicit regularization. Instead, they rely on large amounts of labeled data to capture prior statistics, which are n… ▽ More

    Submitted 1 November, 2021; originally announced November 2021.

    Comments: accepted by NeurIPS 2021 as "spotlight"

  5. arXiv:2011.00320  [pdf, other

    cs.CV cs.LG cs.RO

    Scene Flow from Point Clouds with or without Learning

    Authors: Jhony Kaesemodel Pontes, James Hays, Simon Lucey

    Abstract: Scene flow is the three-dimensional (3D) motion field of a scene. It provides information about the spatial arrangement and rate of change of objects in dynamic environments. Current learning-based approaches seek to estimate the scene flow directly from point clouds and have achieved state-of-the-art performance. However, supervised learning methods are inherently domain specific and require a la… ▽ More

    Submitted 31 October, 2020; originally announced November 2020.

    Comments: International Conference on 3D Vision (3DV 2020)

  6. arXiv:2008.09527  [pdf, other

    cs.CV

    PointNetLK Revisited

    Authors: Xueqian Li, Jhony Kaesemodel Pontes, Simon Lucey

    Abstract: We address the generalization ability of recent learning-based point cloud registration methods. Despite their success, these approaches tend to have poor performance when applied to mismatched conditions that are not well-represented in the training set, such as unseen object categories, different complex scenes, or unknown depth sensors. In these circumstances, it has often been better to rely o… ▽ More

    Submitted 29 March, 2021; v1 submitted 21 August, 2020; originally announced August 2020.

  7. arXiv:1901.06802  [pdf, other

    cs.CV

    Deep Level Sets: Implicit Surface Representations for 3D Shape Inference

    Authors: Mateusz Michalkiewicz, Jhony K. Pontes, Dominic Jack, Mahsa Baktashmotlagh, Anders Eriksson

    Abstract: Existing 3D surface representation approaches are unable to accurately classify pixels and their orientation lying on the boundary of an object. Thus resulting in coarse representations which usually require post-processing steps to extract 3D surface meshes. To overcome this limitation, we propose an end-to-end trainable model that directly predicts implicit surface representations of arbitrary t… ▽ More

    Submitted 21 January, 2019; originally announced January 2019.

  8. arXiv:1803.10932  [pdf, other

    cs.CV

    Learning Free-Form Deformations for 3D Object Reconstruction

    Authors: Dominic Jack, Jhony K. Pontes, Sridha Sridharan, Clinton Fookes, Sareh Shirazi, Frederic Maire, Anders Eriksson

    Abstract: Representing 3D shape in deep learning frameworks in an accurate, efficient and compact manner still remains an open challenge. Most existing work addresses this issue by employing voxel-based representations. While these approaches benefit greatly from advances in computer vision by generalizing 2D convolutions to the 3D setting, they also have several considerable drawbacks. The computational co… ▽ More

    Submitted 29 March, 2018; originally announced March 2018.

    Comments: 16 pages, 7 figures, 3 tables

    Journal ref: Asian Conference on Computer Vision (ACCV) 2018

  9. arXiv:1711.10669  [pdf, other

    cs.CV

    Image2Mesh: A Learning Framework for Single Image 3D Reconstruction

    Authors: Jhony K. Pontes, Chen Kong, Sridha Sridharan, Simon Lucey, Anders Eriksson, Clinton Fookes

    Abstract: One challenge that remains open in 3D deep learning is how to efficiently represent 3D data to feed deep networks. Recent works have relied on volumetric or point cloud representations, but such approaches suffer from a number of issues such as computational complexity, unordered data, and lack of finer geometry. This paper demonstrates that a mesh representation (i.e. vertices and faces to form p… ▽ More

    Submitted 28 November, 2017; originally announced November 2017.

    Comments: 9 pages, 4 figures

    Journal ref: Asian Conference on Computer Vision (ACCV) 2018

  10. Compact Model Representation for 3D Reconstruction

    Authors: Jhony K. Pontes, Chen Kong, Anders Eriksson, Clinton Fookes, Sridha Sridharan, Simon Lucey

    Abstract: 3D reconstruction from 2D images is a central problem in computer vision. Recent works have been focusing on reconstruction directly from a single image. It is well known however that only one image cannot provide enough information for such a reconstruction. A prior knowledge that has been entertained are 3D CAD models due to its online ubiquity. A fundamental question is how to compactly represe… ▽ More

    Submitted 23 July, 2017; originally announced July 2017.

    Comments: 9 pages, 6 figures

    Journal ref: 2017 International Conference on 3D Vision (3DV)