Skip to main content

Showing 1–7 of 7 results for author: Gouda, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2504.02812  [pdf, other

    cs.CV

    BOP Challenge 2024 on Model-Based and Model-Free 6D Object Pose Estimation

    Authors: Van Nguyen Nguyen, Stephen Tyree, Andrew Guo, Mederic Fourmy, Anas Gouda, Taeyeop Lee, Sungphill Moon, Hyeontae Son, Lukas Ranftl, Jonathan Tremblay, Eric Brachmann, Bertram Drost, Vincent Lepetit, Carsten Rother, Stan Birchfield, Jiri Matas, Yann Labbe, Martin Sundermeyer, Tomas Hodan

    Abstract: We present the evaluation methodology, datasets and results of the BOP Challenge 2024, the 6th in a series of public competitions organized to capture the state of the art in 6D object pose estimation and related tasks. In 2024, our goal was to transition BOP from lab-like setups to real-world scenarios. First, we introduced new model-free tasks, where no 3D object models are available and methods… ▽ More

    Submitted 23 April, 2025; v1 submitted 3 April, 2025; originally announced April 2025.

    Comments: arXiv admin note: text overlap with arXiv:2403.09799

  2. arXiv:2404.06277  [pdf, other

    cs.CV

    Learning Embeddings with Centroid Triplet Loss for Object Identification in Robotic Grasping

    Authors: Anas Gouda, Max Schwarz, Christopher Reining, Sven Behnke, Alice Kirchheim

    Abstract: Foundation models are a strong trend in deep learning and computer vision. These models serve as a base for applications as they require minor or no further fine-tuning by developers to integrate into their applications. Foundation models for zero-shot object segmentation such as Segment Anything (SAM) output segmentation masks from images without any further object information. When they are foll… ▽ More

    Submitted 8 July, 2024; v1 submitted 9 April, 2024; originally announced April 2024.

    Comments: Accepted to CASE 2024

  3. arXiv:2305.00718  [pdf

    cs.CV

    Event Camera as Region Proposal Network

    Authors: Shrutarv Awasthi, Anas Gouda, Richard Julian Lodenkaemper, Moritz Roidl

    Abstract: The human eye consists of two types of photoreceptors, rods and cones. Rods are responsible for monochrome vision, and cones for color vision. The number of rods is much higher than the cones, which means that most human vision processing is done in monochrome. An event camera reports the change in pixel intensity and is analogous to rods. Event and color cameras in computer vision are like rods a… ▽ More

    Submitted 1 May, 2023; originally announced May 2023.

  4. arXiv:2304.02833  [pdf, other

    cs.CV cs.RO

    DoUnseen: Tuning-Free Class-Adaptive Object Detection of Unseen Objects for Robotic Grasping

    Authors: Anas Gouda, Moritz Roidl

    Abstract: How can we segment varying numbers of objects where each specific object represents its own separate class? To make the problem even more realistic, how can we add and delete classes on the fly without retraining or fine-tuning? This is the case of robotic applications where no datasets of the objects exist or application that includes thousands of objects (E.g., in logistics) where it is impossib… ▽ More

    Submitted 27 November, 2023; v1 submitted 5 April, 2023; originally announced April 2023.

    Comments: presented at RSS 2023 Workshop on Perception and Manipulation Challenges for Warehouse Automation

  5. arXiv:2212.04721  [pdf, other

    cs.LG cs.RO

    A Grid-based Sensor Floor Platform for Robot Localization using Machine Learning

    Authors: Anas Gouda, Danny Heinrich, Mirco Hünnefeld, Irfan Fachrudin Priyanta, Christopher Reining, Moritz Roidl

    Abstract: Wireless Sensor Network (WSN) applications reshape the trend of warehouse monitoring systems allowing them to track and locate massive numbers of logistic entities in real-time. To support the tasks, classic Radio Frequency (RF)-based localization approaches (e.g. triangulation and trilateration) confront challenges due to multi-path fading and signal loss in noisy warehouse environment. In this p… ▽ More

    Submitted 9 December, 2022; originally announced December 2022.

    Comments: This is a preprint version for IEEE I2MTC 2023

  6. arXiv:2204.13613  [pdf, other

    cs.RO

    DoPose-6D dataset for object segmentation and 6D pose estimation

    Authors: Anas Gouda, Abraham Ghanem, Christopher Reining

    Abstract: Scene understanding is essential in determining how intelligent robotic grasping and manipulation could get. It is a problem that can be approached using different techniques: seen object segmentation, unseen object segmentation, or 6D pose estimation. These techniques can even be extended to multi-view. Most of the work on these problems depends on synthetic datasets due to the lack of real datas… ▽ More

    Submitted 28 November, 2022; v1 submitted 28 April, 2022; originally announced April 2022.

    Comments: accepted for IEEE ICMLA 2022

  7. arXiv:2010.10340  [pdf, other

    cs.CV cs.LG

    Leveraging SLIC Superpixel Segmentation and Cascaded Ensemble SVM for Fully Automated Mass Detection In Mammograms

    Authors: Jaime Simarro, Zohaib Salahuddin, Ahmed Gouda, Anindo Saha

    Abstract: Identification and segmentation of breast masses in mammograms face complex challenges, owing to the highly variable nature of malignant densities with regards to their shape, contours, texture and orientation. Additionally, classifiers typically suffer from high class imbalance in region candidates, where normal tissue regions vastly outnumber malignant masses. This paper proposes a rigorous segm… ▽ More

    Submitted 20 October, 2020; originally announced October 2020.