Skip to main content

Showing 1–4 of 4 results for author: Tarashima, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2503.23529  [pdf, other

    cs.CV

    ViLAaD: Enhancing "Attracting and Dispersing'' Source-Free Domain Adaptation with Vision-and-Language Model

    Authors: Shuhei Tarashima, Xinqi Shu, Norio Tagawa

    Abstract: Source-Free Domain Adaptation (SFDA) aims to adapt a pre-trained source model to a target dataset from a different domain without access to the source data. Conventional SFDA methods are limited by the information encoded in the pre-trained source model and the unlabeled target data. Recently, approaches leveraging auxiliary resources have emerged, yet remain in their early stages, offering ample… ▽ More

    Submitted 30 March, 2025; originally announced March 2025.

    Comments: 15 pages

  2. arXiv:2404.10272  [pdf, other

    cs.CV

    Plug-and-Play Acceleration of Occupancy Grid-based NeRF Rendering using VDB Grid and Hierarchical Ray Traversal

    Authors: Yoshio Kato, Shuhei Tarashima

    Abstract: Transmittance estimators such as Occupancy Grid (OG) can accelerate the training and rendering of Neural Radiance Field (NeRF) by predicting important samples that contributes much to the generated image. However, OG manages occupied regions in the form of the dense binary grid, in which there are many blocks with the same values that cause redundant examination of voxels' emptiness in ray-tracing… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

    Comments: Short paper for CVPR Neural Rendering Intelligence Workshop 2024. Code: https://github.com/Yosshi999/faster-occgrid

  3. arXiv:2311.05237  [pdf, other

    cs.CV

    Widely Applicable Strong Baseline for Sports Ball Detection and Tracking

    Authors: Shuhei Tarashima, Muhammad Abdul Haq, Yushan Wang, Norio Tagawa

    Abstract: In this work, we present a novel Sports Ball Detection and Tracking (SBDT) method that can be applied to various sports categories. Our approach is composed of (1) high-resolution feature extraction, (2) position-aware model training, and (3) inference considering temporal consistency, all of which are put together as a new SBDT baseline. Besides, to validate the wide-applicability of our approach… ▽ More

    Submitted 16 November, 2023; v1 submitted 9 November, 2023; originally announced November 2023.

    Comments: BMVC2023. Code & dataset : https://github.com/nttcom/WASB-SBDT

  4. arXiv:2309.04675  [pdf, other

    cs.CV

    BiLMa: Bidirectional Local-Matching for Text-based Person Re-identification

    Authors: Takuro Fujii, Shuhei Tarashima

    Abstract: Text-based person re-identification (TBPReID) aims to retrieve person images represented by a given textual query. In this task, how to effectively align images and texts globally and locally is a crucial challenge. Recent works have obtained high performances by solving Masked Language Modeling (MLM) to align image/text parts. However, they only performed uni-directional (i.e., from image to text… ▽ More

    Submitted 9 September, 2023; originally announced September 2023.

    Comments: Accepted at ICCVW 2023