Skip to main content

Showing 1–30 of 30 results for author: Miraldo, P

.
  1. arXiv:2506.08635  [pdf, ps, other

    cs.CV

    SurfR: Surface Reconstruction with Multi-scale Attention

    Authors: Siddhant Ranade, Gonçalo Dias Pais, Ross Tyler Whitaker, Jacinto C. Nascimento, Pedro Miraldo, Srikumar Ramalingam

    Abstract: We propose a fast and accurate surface reconstruction algorithm for unorganized point clouds using an implicit representation. Recent learning methods are either single-object representations with small neural models that allow for high surface details but require per-object training or generalized representations that require larger models and generalize to newer shapes but lack details, and infe… ▽ More

    Submitted 10 June, 2025; originally announced June 2025.

    Comments: Accepted in 3DV 2025

    Journal ref: International Conference on 3D Vision 2025

  2. A Probability-guided Sampler for Neural Implicit Surface Rendering

    Authors: Gonçalo Dias Pais, Valter Piedade, Moitreya Chatterjee, Marcus Greiff, Pedro Miraldo

    Abstract: Several variants of Neural Radiance Fields (NeRFs) have significantly improved the accuracy of synthesized images and surface reconstruction of 3D scenes/objects. In all of these methods, a key characteristic is that none can train the neural network with every possible input data, specifically, every pixel and potential 3D point along the projection rays due to scalability issues. While vanilla N… ▽ More

    Submitted 10 June, 2025; originally announced June 2025.

    Comments: Accepted in ECCV 2024

    Journal ref: European Conference on Computer Vision 2024 (pp. 164-182)

  3. arXiv:2504.20222  [pdf, other

    cs.CV

    FreBIS: Frequency-Based Stratification for Neural Implicit Surface Representations

    Authors: Naoko Sawada, Pedro Miraldo, Suhas Lohit, Tim K. Marks, Moitreya Chatterjee

    Abstract: Neural implicit surface representation techniques are in high demand for advancing technologies in augmented reality/virtual reality, digital twins, autonomous navigation, and many other fields. With their ability to model object surfaces in a scene as a continuous function, such techniques have made remarkable strides recently, especially over classical 3D surface reconstruction methods, such as… ▽ More

    Submitted 28 April, 2025; originally announced April 2025.

    Comments: Accepted to CVPR 2025 CV4Metaverse Workshop

  4. arXiv:2406.03723  [pdf, other

    cs.CV cs.GR cs.MM

    Gear-NeRF: Free-Viewpoint Rendering and Tracking with Motion-aware Spatio-Temporal Sampling

    Authors: Xinhang Liu, Yu-Wing Tai, Chi-Keung Tang, Pedro Miraldo, Suhas Lohit, Moitreya Chatterjee

    Abstract: Extensions of Neural Radiance Fields (NeRFs) to model dynamic scenes have enabled their near photo-realistic, free-viewpoint rendering. Although these methods have shown some potential in creating immersive experiences, two drawbacks limit their ubiquity: (i) a significant reduction in reconstruction quality when the computing budget is limited, and (ii) a lack of semantic understanding of the und… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: Paper accepted to IEEE/CVF CVPR 2024 (Spotlight). Work done when XL was an intern at MERL. Project Page Link: https://merl.com/research/highlights/gear-nerf

    ACM Class: I.2.10

  5. arXiv:2402.06752  [pdf, other

    cs.CV cs.GR

    Oriented-grid Encoder for 3D Implicit Representations

    Authors: Arihant Gaur, G. Dias Pais, Pedro Miraldo

    Abstract: Encoding 3D points is one of the primary steps in learning-based implicit scene representation. Using features that gather information from neighbors with multi-resolution grids has proven to be the best geometric encoder for this task. However, prior techniques do not exploit some characteristics of most objects or scenes, such as surface normals and local smoothness. This paper is the first to e… ▽ More

    Submitted 9 February, 2024; originally announced February 2024.

    Comments: 3DV 2024 paper

  6. arXiv:2309.08690  [pdf, other

    cs.CV cs.RO

    BANSAC: A dynamic BAyesian Network for adaptive SAmple Consensus

    Authors: Valter Piedade, Pedro Miraldo

    Abstract: RANSAC-based algorithms are the standard techniques for robust estimation in computer vision. These algorithms are iterative and computationally expensive; they alternate between random sampling of data, computing hypotheses, and running inlier counting. Many authors tried different approaches to improve efficiency. One of the major improvements is having a guided sampling, letting the RANSAC cycl… ▽ More

    Submitted 25 September, 2023; v1 submitted 15 September, 2023; originally announced September 2023.

    Comments: ICCV 2023 paper

  7. arXiv:2309.08588  [pdf, other

    cs.CV cs.RO

    Robust Frame-to-Frame Camera Rotation Estimation in Crowded Scenes

    Authors: Fabien Delattre, David Dirnfeld, Phat Nguyen, Stephen Scarano, Michael J. Jones, Pedro Miraldo, Erik Learned-Miller

    Abstract: We present an approach to estimating camera rotation in crowded, real-world scenes from handheld monocular video. While camera rotation estimation is a well-studied problem, no previous methods exhibit both high accuracy and acceptable speed in this setting. Because the setting is not addressed well by other datasets, we provide a new dataset and benchmark, with high-accuracy, rigorously verified… ▽ More

    Submitted 15 September, 2023; originally announced September 2023.

    Comments: Published at ICCV 2023

  8. An observer cascade for velocity and multiple line estimation

    Authors: André Mateus, Pedro U. Lima, Pedro Miraldo

    Abstract: Previous incremental estimation methods consider estimating a single line, requiring as many observers as the number of lines to be mapped. This leads to the need for having at least $4N$ state variables, with $N$ being the number of lines. This paper presents the first approach for multi-line incremental estimation. Since lines are common in structured environments, we aim to exploit that structu… ▽ More

    Submitted 3 March, 2022; originally announced March 2022.

    Comments: Accepted for publication in 2022 IEEE's International Conference on Robotics and Automation (ICRA)

  9. arXiv:2109.00505  [pdf, other

    math.NA cs.RO

    Solving the Discrete Euler-Arnold Equations for the Generalized Rigid Body Motion

    Authors: Joao R. Cardoso, Pedro Miraldo

    Abstract: We propose three iterative methods for solving the Moser-Veselov equation, which arises in the discretization of the Euler-Arnold differential equations governing the motion of a generalized rigid body. We start by formulating the problem as an optimization problem with orthogonal constraints and proving that the objective function is convex. Then, using techniques from optimization on Riemannian… ▽ More

    Submitted 1 September, 2021; originally announced September 2021.

    Journal ref: Journal of Computational and Applied Mathematics, 2021

  10. On Incremental Structure-from-Motion using Lines

    Authors: André Mateus, Omar Tahri, A. Pedro Aguiar, Pedro U. Lima, Pedro Miraldo

    Abstract: Humans tend to build environments with structure, which consists of mainly planar surfaces. From the intersection of planar surfaces arise straight lines. Lines have more degrees-of-freedom than points. Thus, line-based Structure-from-Motion (SfM) provides more information about the environment. In this paper, we present solutions for SfM using lines, namely, incremental SfM. These approaches cons… ▽ More

    Submitted 24 May, 2021; originally announced May 2021.

    Comments: To appear in IEEE Transactions on Robotics \c{opyright} 2021 IEEE

  11. arXiv:2010.02516  [pdf, other

    cs.CV cs.RO

    Mapping of Sparse 3D Data using Alternating Projection

    Authors: Siddhant Ranade, Xin Yu, Shantnu Kakkar, Pedro Miraldo, Srikumar Ramalingam

    Abstract: We propose a novel technique to register sparse 3D scans in the absence of texture. While existing methods such as KinectFusion or Iterative Closest Points (ICP) heavily rely on dense point clouds, this task is particularly challenging under sparse conditions without RGB data. Sparse texture-less data does not come with high-quality boundary signal, and this prohibits the use of correspondences fr… ▽ More

    Submitted 9 October, 2020; v1 submitted 4 October, 2020; originally announced October 2020.

    Comments: ACCV2020 oral. This article supersedes arXiv:1906.05888

  12. arXiv:2003.07137  [pdf, other

    cs.RO cs.CV

    Active Depth Estimation: Stability Analysis and its Applications

    Authors: Romulo T. Rodrigues, Pedro Miraldo, Dimos V. Dimarogonas, A. Pedro Aguiar

    Abstract: Recovering the 3D structure of the surrounding environment is an essential task in any vision-controlled Structure-from-Motion (SfM) scheme. This paper focuses on the theoretical properties of the SfM, known as the incremental active depth estimation. The term incremental stands for estimating the 3D structure of the scene over a chronological sequence of image frames. Active means that the camera… ▽ More

    Submitted 16 March, 2020; originally announced March 2020.

    Comments: 7 pages, 3 figures, conference

    Journal ref: International Conference on Robotics and Automation (ICRA), 2020

  13. arXiv:1908.00309  [pdf, other

    cs.RO cs.CV

    A Framework for Depth Estimation and Relative Localization of Ground Robots using Computer Vision

    Authors: Romulo T. Rodrigues, Pedro Miraldo, Dimos V. Dimarogonas, A. Pedro Aguiar

    Abstract: The 3D depth estimation and relative pose estimation problem within a decentralized architecture is a challenging problem that arises in missions that require coordination among multiple vision-controlled robots. The depth estimation problem aims at recovering the 3D information of the environment. The relative localization problem consists of estimating the relative pose between two robots, by se… ▽ More

    Submitted 1 August, 2019; originally announced August 2019.

    Comments: 6 pages, 7 figures, conference

    Journal ref: IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2019

  14. arXiv:1906.05888  [pdf, other

    cs.CV

    Can generalised relative pose estimation solve sparse 3D registration?

    Authors: Siddhant Ranade, Xin Yu, Shantnu Kakkar, Pedro Miraldo, Srikumar Ramalingam

    Abstract: Popular 3D scan registration projects, such as Stanford digital Michelangelo or KinectFusion, exploit the high-resolution sensor data for scan alignment. It is particularly challenging to solve the registration of sparse 3D scans in the absence of RGB components. In this case, we can not establish point correspondences since the same 3D point cannot be captured in two successive scans. In contrast… ▽ More

    Submitted 13 June, 2019; originally announced June 2019.

  15. arXiv:1904.04858  [pdf, other

    cs.CV cs.RO

    POSEAMM: A Unified Framework for Solving Pose Problems using an Alternating Minimization Method

    Authors: Joao Campos, Joao R. Cardoso, Pedro Miraldo

    Abstract: Pose estimation is one of the most important problems in computer vision. It can be divided in two different categories -- absolute and relative -- and may involve two different types of camera models: central and non-central. State-of-the-art methods have been designed to solve separately these problems. This paper presents a unified framework that is able to solve any pose problem by alternating… ▽ More

    Submitted 9 April, 2019; originally announced April 2019.

    Comments: 12 pages, 5 figures

    Journal ref: IEEE International Conference on Robotics and Automation (ICRA), 2019

  16. arXiv:1904.03941  [pdf, other

    cs.CV

    Minimal Solvers for Mini-Loop Closures in 3D Multi-Scan Alignment

    Authors: Pedro Miraldo, Surojit Saha, Srikumar Ramalingam

    Abstract: 3D scan registration is a classical, yet a highly useful problem in the context of 3D sensors such as Kinect and Velodyne. While there are several existing methods, the techniques are usually incremental where adjacent scans are registered first to obtain the initial poses, followed by motion averaging and bundle-adjustment refinement. In this paper, we take a different approach and develop minima… ▽ More

    Submitted 8 April, 2019; originally announced April 2019.

    Comments: 10 pages, 5 figures, 5 tables

    Journal ref: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2019

  17. arXiv:1904.01701  [pdf, other

    cs.CV

    3DRegNet: A Deep Neural Network for 3D Point Registration

    Authors: G. Dias Pais, Srikumar Ramalingam, Venu Madhav Govindu, Jacinto C. Nascimento, Rama Chellappa, Pedro Miraldo

    Abstract: We present 3DRegNet, a novel deep learning architecture for the registration of 3D scans. Given a set of 3D point correspondences, we build a deep neural network to address the following two challenges: (i) classification of the point correspondences into inliers/outliers, and (ii) regression of the motion parameters that align the scans into a common reference frame. With regard to regression, we… ▽ More

    Submitted 7 April, 2020; v1 submitted 2 April, 2019; originally announced April 2019.

    Comments: 15 pages, 8 figures, 6 tables

    Journal ref: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2020

  18. arXiv:1903.00676  [pdf, other

    cs.CV cs.RO

    OmniDRL: Robust Pedestrian Detection using Deep Reinforcement Learning on Omnidirectional Cameras

    Authors: G. Dias Pais, Tiago J. Dias, Jacinto C. Nascimento, Pedro Miraldo

    Abstract: Pedestrian detection is one of the most explored topics in computer vision and robotics. The use of deep learning methods allowed the development of new and highly competitive algorithms. Deep Reinforcement Learning has proved to be within the state-of-the-art in terms of both detection in perspective cameras and robotics applications. However, for detection in omnidirectional cameras, the literat… ▽ More

    Submitted 2 March, 2019; originally announced March 2019.

    Comments: Accepted in 2019 IEEE Int'l Conf. Robotics and Automation (ICRA)

  19. arXiv:1902.00473  [pdf, other

    cs.RO

    Active Estimation of 3D Lines in Spherical Coordinates

    Authors: André Mateus, Omar Tahri, Pedro Miraldo

    Abstract: Straight lines are common features in human made environments, which makes them a frequently explored feature for control applications. Many control schemes, like Visual Servoing, require the 3D parameters of the features to be estimated. In order to obtain the 3D structure of lines, a nonlinear observer is proposed. However, to guarantee convergence, the dynamical system must be coupled with an a… ▽ More

    Submitted 21 March, 2019; v1 submitted 1 February, 2019; originally announced February 2019.

    Comments: Accepted in 2019 American Control Conference (ACC) (Final Version)

  20. arXiv:1807.09970  [pdf, other

    cs.CV cs.RO

    A Minimal Closed-Form Solution for Multi-Perspective Pose Estimation using Points and Lines

    Authors: Pedro Miraldo, Tiago Dias, Srikumar Ramalingam

    Abstract: We propose a minimal solution for pose estimation using both points and lines for a multi-perspective camera. In this paper, we treat the multi-perspective camera as a collection of rigidly attached perspective cameras. These type of imaging devices are useful for several computer vision applications that require a large coverage such as surveillance, self-driving cars, and motion-capture studios.… ▽ More

    Submitted 26 July, 2018; originally announced July 2018.

    Comments: 22 pages, 6 figures

    Journal ref: European Conference on Computer Vision (ECCV), 2018

  21. arXiv:1807.00753  [pdf, other

    cs.RO

    Active Structure-from-Motion for 3D Straight Lines

    Authors: André Mateus, Omar Tahri, Pedro Miraldo

    Abstract: A reliable estimation of 3D parameters is a must for several applications like planning and control. Included in the latter is the Image-Based Visual Servoing, whose control scheme depends directly on 3D parameters e.g. depth of points, and depth and direction of 3D straight lines. Recently a framework for Active Structure-from-Motion was proposed, addressing the former feature type. However, stra… ▽ More

    Submitted 12 December, 2018; v1 submitted 2 July, 2018; originally announced July 2018.

    Comments: 8 pages, 4 figures, Final Version of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 2018

  22. arXiv:1804.09460  [pdf, other

    cs.CV cs.RO

    Analytical Modeling of Vanishing Points and Curves in Catadioptric Cameras

    Authors: Pedro Miraldo, Francisco Eiras, Srikumar Ramalingam

    Abstract: Vanishing points and vanishing lines are classical geometrical concepts in perspective cameras that have a lineage dating back to 3 centuries. A vanishing point is a point on the image plane where parallel lines in 3D space appear to converge, whereas a vanishing line passes through 2 or more vanishing points. While such concepts are simple and intuitive in perspective cameras, their counterparts… ▽ More

    Submitted 25 April, 2018; originally announced April 2018.

    Journal ref: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2018

  23. arXiv:1801.07249  [pdf, other

    cs.RO

    Low-level Active Visual Navigation: Increasing robustness of vision-based localization using potential fields

    Authors: Romulo T. Rodrigues, Meysam Basiri, A. Pedro Aguiar, Pedro Miraldo

    Abstract: This paper proposes a low-level visual navigation algorithm to improve visual localization of a mobile robot. The algorithm, based on artificial potential fields, associates each feature in the current image frame with an attractive or neutral potential energy, with the objective of generating a control action that drives the vehicle towards the goal, while still favoring feature rich areas within… ▽ More

    Submitted 23 March, 2018; v1 submitted 21 January, 2018; originally announced January 2018.

    Comments: accepted for ICRA 2018. arXiv admin note: text overlap with arXiv:1709.04687

  24. arXiv:1709.06451  [pdf, other

    cs.CV

    3D Reconstruction with Low Resolution, Small Baseline and High Radial Distortion Stereo Images

    Authors: Tiago Dias, Helder Araujo, Pedro Miraldo

    Abstract: In this paper we analyze and compare approaches for 3D reconstruction from low-resolution (250x250), high radial distortion stereo images, which are acquired with small baseline (approximately 1mm). These images are acquired with the system NanEye Stereo manufactured by CMOSIS/AWAIBA. These stereo cameras have also small apertures, which means that high levels of illumination are required. The goa… ▽ More

    Submitted 19 September, 2017; originally announced September 2017.

    Journal ref: ACM Int'l Conf. Distributed Smart Cameras (ICDSC), 2016

  25. arXiv:1709.06328  [pdf, other

    cs.CV

    On the Generalized Essential Matrix Correction: An efficient solution to the problem and its applications

    Authors: Pedro Miraldo, Joao R. Cardoso

    Abstract: This paper addresses the problem of finding the closest generalized essential matrix from a given $6\times 6$ matrix, with respect to the Frobenius norm. To the best of our knowledge, this nonlinear constrained optimization problem has not been addressed in the literature yet. Although it can be solved directly, it involves a large number of constraints, and any optimization method to solve it wou… ▽ More

    Submitted 16 March, 2020; v1 submitted 19 September, 2017; originally announced September 2017.

    Comments: 14 pages, 7 figures, journal

    Journal ref: Journal of Mathematical Imaging and Vision, 2020

  26. arXiv:1709.04687  [pdf, other

    cs.RO

    Feature Based Potential Field for Low-level Active Visual Navigation

    Authors: Rômulo T. Rodrigues, Meysam Basiri, A. Pedro Aguiar, Pedro Miraldo

    Abstract: This paper proposes a novel solution for improving visual localization in an active fashion. The solution, based on artificial potential field, associates each feature in the current image frame with an attractive or neutral potential energy. The resultant action drives the vehicle towards the goal, while still favoring feature rich areas. Experimental results with a mini quadrotor equipped with a… ▽ More

    Submitted 14 September, 2017; originally announced September 2017.

    Comments: 11 pages, 6 figures, to published on ROBOT'17: Third Iberian Robotics Conference

  27. arXiv:1607.04441  [pdf, other

    cs.RO cs.CV

    Efficient and Robust Pedestrian Detection using Deep Learning for Human-Aware Navigation

    Authors: Andre Mateus, David Ribeiro, Pedro Miraldo, Jacinto C. Nascimento

    Abstract: This paper addresses the problem of Human-Aware Navigation (HAN), using multi camera sensors to implement a vision-based person tracking system. The main contributions of this paper are as follows: a novel and efficient Deep Learning person detection and a standardization of human-aware constraints. In the first stage of the approach, we propose to cascade the Aggregate Channel Features (ACF) dete… ▽ More

    Submitted 13 December, 2018; v1 submitted 15 July, 2016; originally announced July 2016.

    Comments: Accepted in Robotics and Autonomous Systems

  28. arXiv:1607.04436  [pdf, other

    cs.RO cs.CV

    A Real-Time Deep Learning Pedestrian Detector for Robot Navigation

    Authors: David Ribeiro, Andre Mateus, Pedro Miraldo, Jacinto C. Nascimento

    Abstract: A real-time Deep Learning based method for Pedestrian Detection (PD) is applied to the Human-Aware robot navigation problem. The pedestrian detector combines the Aggregate Channel Features (ACF) detector with a deep Convolutional Neural Network (CNN) in order to obtain fast and accurate performance. Our solution is firstly evaluated using a set of real images taken from onboard and offboard camera… ▽ More

    Submitted 19 September, 2017; v1 submitted 15 July, 2016; originally announced July 2016.

    Journal ref: IEEE Int'l Conf. Autonomous Robot Systems and Competitions (ICARSC), 2017

  29. arXiv:1607.02290  [pdf, other

    cs.RO cs.CV

    Non-Central Catadioptric Cameras Pose Estimation using 3D Lines

    Authors: Andre Mateus, Pedro Miraldo, Pedro U. Lima

    Abstract: In this article we purpose a novel method for planar pose estimation of mobile robots. This method is based on an analytic solution (which we derived) for the projection of 3D straight lines, onto the mirror of Non-Central Catadioptric Cameras (NCCS). The resulting solution is rewritten as a function of the rotation and translation parameters, which is then used as an error function for a set of m… ▽ More

    Submitted 8 July, 2016; originally announced July 2016.

  30. arXiv:1602.05990  [pdf, ps, other

    cs.CV cs.RO

    Plücker Correction Problem: Analysis and Improvements in Efficiency

    Authors: João R. Cardoso, Pedro Miraldo, Helder Araujo

    Abstract: A given six dimensional vector represents a 3D straight line in Plucker coordinates if its coordinates satisfy the Klein quadric constraint. In many problems aiming to find the Plucker coordinates of lines, noise in the data and other type of errors contribute for obtaining 6D vectors that do not correspond to lines, because of that constraint. A common procedure to overcome this drawback is to… ▽ More

    Submitted 18 February, 2016; originally announced February 2016.