Skip to main content

Showing 1–10 of 10 results for author: Ratsch, M

.
  1. NVP-HRI: Zero Shot Natural Voice and Posture-based Human-Robot Interaction via Large Language Model

    Authors: Yuzhi Lai, Shenghai Yuan, Youssef Nassar, Mingyu Fan, Thomas Weber, Matthias Rätsch

    Abstract: Effective Human-Robot Interaction (HRI) is crucial for future service robots in aging societies. Existing solutions are biased toward only well-trained objects, creating a gap when dealing with new objects. Currently, HRI systems using predefined gestures or language tokens for pretrained objects pose challenges for all individuals, especially elderly ones. These challenges include difficulties in… ▽ More

    Submitted 12 March, 2025; originally announced March 2025.

    Comments: This work has been accepted for publication in ESWA @ 2025 Elsevier. Personal use of this material is permitted. Permission from Elsevier must be obtained for all other uses, including reprinting/redistribution, creating new works, or reuse of any copyrighted components of this work in other media

  2. Natural Multimodal Fusion-Based Human-Robot Interaction: Application With Voice and Deictic Posture via Large Language Model

    Authors: Yuzhi Lai, Shenghai Yuan, Youssef Nassar, Mingyu Fan, Atmaraaj Gopal, Arihiro Yorita, Naoyuki Kubota, Matthias Rätsch

    Abstract: Translating human intent into robot commands is crucial for the future of service robots in an aging society. Existing Human-Robot Interaction (HRI) systems relying on gestures or verbal commands are impractical for the elderly due to difficulties with complex syntax or sign language. To address the challenge, this paper introduces a multi-modal interaction framework that combines voice and deicti… ▽ More

    Submitted 4 April, 2025; v1 submitted 1 January, 2025; originally announced January 2025.

    Comments: Accepted for publication by IEEE Robotics & Automation Magazine

  3. arXiv:2411.05627  [pdf, other

    math.OC eess.SY

    Large problems are not necessarily hard: A case study on distributed NMPC paying off

    Authors: Gösta Stomberg, Maurice Raetsch, Alexander Engelmann, Timm Faulwasser

    Abstract: A key motivation in the development of Distributed Model Predictive Control (DMPC) is to accelerate centralized Model Predictive Control (MPC) for large-scale systems. DMPC has the prospect of scaling well by parallelizing computations among subsystems. However, communication delays may deteriorate the performance of decentralized optimization, if excessively many iterations are required per contr… ▽ More

    Submitted 15 April, 2025; v1 submitted 8 November, 2024; originally announced November 2024.

  4. arXiv:2111.05149  [pdf, other

    cs.CV cs.LG

    Ethically aligned Deep Learning: Unbiased Facial Aesthetic Prediction

    Authors: Michael Danner, Thomas Weber, Leping Peng, Tobias Gerlach, Xueping Su, Matthias Rätsch

    Abstract: Facial beauty prediction (FBP) aims to develop a machine that automatically makes facial attractiveness assessment. In the past those results were highly correlated with human ratings, therefore also with their bias in annotating. As artificial intelligence can have racist and discriminatory tendencies, the cause of skews in the data must be identified. Development of training data and AI algorith… ▽ More

    Submitted 9 November, 2021; originally announced November 2021.

    Comments: Peer reviewed and accepted at CEPE/IACAP 2021 as Extended Abstract

  5. arXiv:1803.05536  [pdf, other

    cs.CV

    Evaluation of Dense 3D Reconstruction from 2D Face Images in the Wild

    Authors: Zhen-Hua Feng, Patrik Huber, Josef Kittler, Peter JB Hancock, Xiao-Jun Wu, Qijun Zhao, Paul Koppen, Matthias Rätsch

    Abstract: This paper investigates the evaluation of dense 3D face reconstruction from a single 2D image in the wild. To this end, we organise a competition that provides a new benchmark dataset that contains 2000 2D facial images of 135 subjects as well as their 3D ground truth face scans. In contrast to previous competitions or challenges, the aim of this new benchmark dataset is to evaluate the accuracy o… ▽ More

    Submitted 20 April, 2018; v1 submitted 14 March, 2018; originally announced March 2018.

    Comments: 8 pages

  6. arXiv:1803.02257  [pdf

    cs.CV cs.RO

    Methodology to analyze the accuracy of 3D objects reconstructed with collaborative robot based monocular LSD-SLAM

    Authors: Sergey Triputen, Atmaraaj Gopal, Thomas Weber, Christian Hofert, Kristiaan Schreve, Matthias Ratsch

    Abstract: SLAM systems are mainly applied for robot navigation while research on feasibility for motion planning with SLAM for tasks like bin-picking, is scarce. Accurate 3D reconstruction of objects and environments is important for planning motion and computing optimal gripper pose to grasp objects. In this work, we propose the methods to analyze the accuracy of a 3D environment reconstructed using a LSD-… ▽ More

    Submitted 6 March, 2018; originally announced March 2018.

    Comments: 5 pages, 5 figures, 2018 International Conference on Intelligent Autonomous Systems (ICoIAS 2018)

  7. arXiv:1707.05982  [pdf, other

    cs.RO

    Closed-form Solution for IMU based LSD-SLAM Point Cloud Conversion into the Scaled 3D World Environment

    Authors: Sergey Triputen, Kristiaan Schreve, Viktor Tkachev, Matthias Ratsch

    Abstract: SLAM is a very popular research stream in computer vision and robotics nowadays. For more effective SLAM implementation it is necessary to have reliable informa- tion about the environment, also the data should be aligned and scaled according to the real world coordinate system. Monocular SLAM research is an attractive sub-stream, because of the low equipment cost, size and weight. In this paper w… ▽ More

    Submitted 19 July, 2017; originally announced July 2017.

    Comments: 6 pages, 8 figures

  8. arXiv:1606.00474  [pdf, other

    cs.CV cs.HC cs.RO

    A 3D Face Modelling Approach for Pose-Invariant Face Recognition in a Human-Robot Environment

    Authors: Michael Grupp, Philipp Kopp, Patrik Huber, Matthias Rätsch

    Abstract: Face analysis techniques have become a crucial component of human-machine interaction in the fields of assistive and humanoid robotics. However, the variations in head-pose that arise naturally in these environments are still a great challenge. In this paper, we present a real-time capable 3D face modelling framework for 2D in-the-wild images that is applicable for robotics. The fitting of the 3D… ▽ More

    Submitted 1 June, 2016; originally announced June 2016.

    MSC Class: 68T45; 68T40; 68T10 ACM Class: I.2.9; I.2.10; I.5.4

  9. 3D Face Tracking and Texture Fusion in the Wild

    Authors: Patrik Huber, Philipp Kopp, Matthias Rätsch, William Christmas, Josef Kittler

    Abstract: We present a fully automatic approach to real-time 3D face reconstruction from monocular in-the-wild videos. With the use of a cascaded-regressor based face tracking and a 3D Morphable Face Model shape fitting, we obtain a semi-dense 3D face shape. We further use the texture information from multiple frames to build a holistic 3D face representation from the video frames. Our system is able to cap… ▽ More

    Submitted 22 May, 2016; originally announced May 2016.

    MSC Class: 68T45 ACM Class: I.4.8; I.4.9; I.2.10

    Journal ref: IEEE Signal Processing Letters (Volume: 24, Issue: 4, April 2017)

  10. Fitting 3D Morphable Models using Local Features

    Authors: Patrik Huber, Zhen-Hua Feng, William Christmas, Josef Kittler, Matthias Rätsch

    Abstract: In this paper, we propose a novel fitting method that uses local image features to fit a 3D Morphable Model to 2D images. To overcome the obstacle of optimising a cost function that contains a non-differentiable feature extraction operator, we use a learning-based cascaded regression method that learns the gradient direction from data. The method allows to simultaneously solve for shape and pose p… ▽ More

    Submitted 8 March, 2015; originally announced March 2015.

    Comments: Submitted to ICIP 2015; 4 pages, 4 figures

    MSC Class: 68T45 ACM Class: I.4.8; I.2.10

    Journal ref: Proceedings of the IEEE International Conference on Image Processing (ICIP) 2015, pages 1195-1199