Skip to main content

Showing 1–10 of 10 results for author: Vaskevicius, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2412.14480  [pdf, other

    cs.RO cs.CL cs.CV cs.LG

    GraphEQA: Using 3D Semantic Scene Graphs for Real-time Embodied Question Answering

    Authors: Saumya Saxena, Blake Buchanan, Chris Paxton, Bingqing Chen, Narunas Vaskevicius, Luigi Palmieri, Jonathan Francis, Oliver Kroemer

    Abstract: In Embodied Question Answering (EQA), agents must explore and develop a semantic understanding of an unseen environment in order to answer a situated question with confidence. This remains a challenging problem in robotics, due to the difficulties in obtaining useful semantic representations, updating these representations online, and leveraging prior world knowledge for efficient exploration and… ▽ More

    Submitted 18 December, 2024; originally announced December 2024.

    Comments: Project website: https://saumyasaxena.github.io/grapheqa

  2. arXiv:2412.13652  [pdf, other

    cs.CV

    RelationField: Relate Anything in Radiance Fields

    Authors: Sebastian Koch, Johanna Wald, Mirco Colosi, Narunas Vaskevicius, Pedro Hermosilla, Federico Tombari, Timo Ropinski

    Abstract: Neural radiance fields are an emerging 3D scene representation and recently even been extended to learn features for scene understanding by distilling open-vocabulary features from vision-language models. However, current method primarily focus on object-centric representations, supporting object segmentation or detection, while understanding semantic relationships between objects remains largely… ▽ More

    Submitted 25 March, 2025; v1 submitted 18 December, 2024; originally announced December 2024.

    Comments: CVPR 2025. Project page: https://relationfield.github.io

  3. arXiv:2411.10175  [pdf, other

    cs.LG cs.AI cs.CV

    The Surprising Ineffectiveness of Pre-Trained Visual Representations for Model-Based Reinforcement Learning

    Authors: Moritz Schneider, Robert Krug, Narunas Vaskevicius, Luigi Palmieri, Joschka Boedecker

    Abstract: Visual Reinforcement Learning (RL) methods often require extensive amounts of data. As opposed to model-free RL, model-based RL (MBRL) offers a potential solution with efficient data utilization through planning. Additionally, RL lacks generalization capabilities for real-world tasks. Prior work has shown that incorporating pre-trained visual representations (PVRs) enhances sample efficiency and g… ▽ More

    Submitted 15 January, 2025; v1 submitted 15 November, 2024; originally announced November 2024.

    Comments: Published at the 38th Conference on Neural Information Processing Systems (NeurIPS 2024). Project page: https://schneimo.com/pvr4mbrl/

  4. arXiv:2402.12259  [pdf, other

    cs.CV

    Open3DSG: Open-Vocabulary 3D Scene Graphs from Point Clouds with Queryable Objects and Open-Set Relationships

    Authors: Sebastian Koch, Narunas Vaskevicius, Mirco Colosi, Pedro Hermosilla, Timo Ropinski

    Abstract: Current approaches for 3D scene graph prediction rely on labeled datasets to train models for a fixed set of known object classes and relationship categories. We present Open3DSG, an alternative approach to learn 3D scene graph prediction in an open world without requiring labeled scene graph data. We co-embed the features from a 3D scene graph prediction backbone with the feature space of powerfu… ▽ More

    Submitted 1 April, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

    Comments: CVPR 2024. Project page: https://kochsebastian.com/open3dsg

  5. arXiv:2310.16494  [pdf, other

    cs.CV

    Lang3DSG: Language-based contrastive pre-training for 3D Scene Graph prediction

    Authors: Sebastian Koch, Pedro Hermosilla, Narunas Vaskevicius, Mirco Colosi, Timo Ropinski

    Abstract: D scene graphs are an emerging 3D scene representation, that models both the objects present in the scene as well as their relationships. However, learning 3D scene graphs is a challenging task because it requires not only object labels but also relationship annotations, which are very scarce in datasets. While it is widely accepted that pre-training is an effective approach to improve model perfo… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

    Comments: 3DV 2024. Project page: https://kochsebastian.com/lang3dsg

  6. arXiv:2309.15702  [pdf, other

    cs.CV

    SGRec3D: Self-Supervised 3D Scene Graph Learning via Object-Level Scene Reconstruction

    Authors: Sebastian Koch, Pedro Hermosilla, Narunas Vaskevicius, Mirco Colosi, Timo Ropinski

    Abstract: In the field of 3D scene understanding, 3D scene graphs have emerged as a new scene representation that combines geometric and semantic information about objects and their relationships. However, learning semantic 3D scene graphs in a fully supervised manner is inherently difficult as it requires not only object-level annotations but also relationship labels. While pre-training approaches have hel… ▽ More

    Submitted 6 November, 2023; v1 submitted 27 September, 2023; originally announced September 2023.

    Comments: WACV 2024, Project page: https://kochsebastian.com/sgrec3d

  7. arXiv:2210.08952  [pdf, other

    cs.RO cs.CV

    Predicting Dense and Context-aware Cost Maps for Semantic Robot Navigation

    Authors: Yash Goel, Narunas Vaskevicius, Luigi Palmieri, Nived Chebrolu, Cyrill Stachniss

    Abstract: We investigate the task of object goal navigation in unknown environments where the target is specified by a semantic label (e.g. find a couch). Such a navigation task is especially challenging as it requires understanding of semantic context in diverse settings. Most of the prior work tackles this problem under the assumption of a discrete action policy whereas we present an approach with continu… ▽ More

    Submitted 17 October, 2022; originally announced October 2022.

    Comments: Accepted at IROS PNARUDE(Perception and Navigation for Autonomous Robotics in Unstructured and Dynamic Environments) Workshop 2022

  8. arXiv:2108.01495  [pdf, other

    cs.RO cs.CV

    Cross-Modal Analysis of Human Detection for Robotics: An Industrial Case Study

    Authors: Timm Linder, Narunas Vaskevicius, Robert Schirmer, Kai O. Arras

    Abstract: Advances in sensing and learning algorithms have led to increasingly mature solutions for human detection by robots, particularly in selected use-cases such as pedestrian detection for self-driving cars or close-range person detection in consumer settings. Despite this progress, the simple question "which sensor-algorithm combination is best suited for a person detection task at hand?" remains har… ▽ More

    Submitted 3 August, 2021; originally announced August 2021.

    Comments: Accepted for publication at 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

  9. arXiv:1908.00151  [pdf, other

    cs.CV stat.ML

    Multi-path Learning for Object Pose Estimation Across Domains

    Authors: Martin Sundermeyer, Maximilian Durner, En Yen Puang, Zoltan-Csaba Marton, Narunas Vaskevicius, Kai O. Arras, Rudolph Triebel

    Abstract: We introduce a scalable approach for object pose estimation trained on simulated RGB views of multiple 3D models together. We learn an encoding of object views that does not only describe an implicit orientation of all objects seen during training, but can also relate views of untrained objects. Our single-encoder-multi-decoder network is trained using a technique we denote "multi-path learning":… ▽ More

    Submitted 3 April, 2020; v1 submitted 31 July, 2019; originally announced August 2019.

    Comments: To appear at CVPR 2020; Code will be available here: https://github.com/DLR-RM/AugmentedAutoencoder/tree/multipath

  10. arXiv:1605.04177  [pdf, other

    cs.RO

    Knowledge-Enabled Robotic Agents for Shelf Replenishment in Cluttered Retail Environments

    Authors: Jan Winkler, Ferenc Balint-Benczedi, Thiemo Wiedemeyer, Michael Beetz, Narunas Vaskevicius, Christian A. Mueller, Tobias Fromm, Andreas Birk

    Abstract: Autonomous robots in unstructured and dynamically changing retail environments have to master complex perception, knowledgeprocessing, and manipulation tasks. To enable them to act competently, we propose a framework based on three core components: (o) a knowledge-enabled perception system, capable of combining diverse information sources to cope with occlusions and stacked objects with a variety… ▽ More

    Submitted 13 May, 2016; originally announced May 2016.

    Comments: published in the proceedings of AAMAS 2016 as an extended abstract

    ACM Class: I.2.10

    Journal ref: International Conference on Autonomous Agents and Multiagent Systems, 2016