Skip to main content

Showing 1–5 of 5 results for author: Artacho, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2411.18944  [pdf, other

    cs.CV

    Waterfall Transformer for Multi-person Pose Estimation

    Authors: Navin Ranjan, Bruno Artacho, Andreas Savakis

    Abstract: We propose the Waterfall Transformer architecture for Pose estimation (WTPose), a single-pass, end-to-end trainable framework designed for multi-person pose estimation. Our framework leverages a transformer-based waterfall module that generates multi-scale feature maps from various backbone stages. The module performs filtering in the cascade architecture to expand the receptive fields and to capt… ▽ More

    Submitted 28 November, 2024; originally announced November 2024.

  2. arXiv:2112.10716  [pdf, other

    cs.CV

    BAPose: Bottom-Up Pose Estimation with Disentangled Waterfall Representations

    Authors: Bruno Artacho, Andreas Savakis

    Abstract: We propose BAPose, a novel bottom-up approach that achieves state-of-the-art results for multi-person pose estimation. Our end-to-end trainable framework leverages a disentangled multi-scale waterfall architecture and incorporates adaptive convolutions to infer keypoints more precisely in crowded scenes with occlusions. The multi-scale representations, obtained by the disentangled waterfall module… ▽ More

    Submitted 20 December, 2021; originally announced December 2021.

  3. arXiv:2103.10180  [pdf, other

    cs.CV cs.LG eess.IV

    OmniPose: A Multi-Scale Framework for Multi-Person Pose Estimation

    Authors: Bruno Artacho, Andreas Savakis

    Abstract: We propose OmniPose, a single-pass, end-to-end trainable framework, that achieves state-of-the-art results for multi-person pose estimation. Using a novel waterfall module, the OmniPose architecture leverages multi-scale feature representations that increase the effectiveness of backbone feature extractors, without the need for post-processing. OmniPose incorporates contextual information across s… ▽ More

    Submitted 18 March, 2021; originally announced March 2021.

    Comments: arXiv admin note: text overlap with arXiv:2001.08095

  4. arXiv:2001.08095  [pdf, other

    cs.CV

    UniPose: Unified Human Pose Estimation in Single Images and Videos

    Authors: Bruno Artacho, Andreas Savakis

    Abstract: We propose UniPose, a unified framework for human pose estimation, based on our "Waterfall" Atrous Spatial Pooling architecture, that achieves state-of-art-results on several pose estimation metrics. Current pose estimation methods utilizing standard CNN architectures heavily rely on statistical postprocessing or predefined anchor poses for joint localization. UniPose incorporates contextual segme… ▽ More

    Submitted 22 January, 2020; originally announced January 2020.

  5. Waterfall Atrous Spatial Pooling Architecture for Efficient Semantic Segmentation

    Authors: Bruno Artacho, Andreas Savakis

    Abstract: We propose a new efficient architecture for semantic segmentation, based on a "Waterfall" Atrous Spatial Pooling architecture, that achieves a considerable accuracy increase while decreasing the number of network parameters and memory footprint. The proposed Waterfall architecture leverages the efficiency of progressive filtering in the cascade architecture while maintaining multiscale fields-of-v… ▽ More

    Submitted 6 December, 2019; originally announced December 2019.

    Comments: 17 pages, 11 figures

    Journal ref: Sensors, 19(24), 2019