Skip to main content

Showing 1–21 of 21 results for author: Ikehata, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2509.20281  [pdf, ps, other

    cs.CV

    PerFace: Metric Learning in Perceptual Facial Similarity for Enhanced Face Anonymization

    Authors: Haruka Kumagai, Leslie Wöhler, Satoshi Ikehata, Kiyoharu Aizawa

    Abstract: In response to rising societal awareness of privacy concerns, face anonymization techniques have advanced, including the emergence of face-swapping methods that replace one identity with another. Achieving a balance between anonymity and naturalness in face swapping requires careful selection of identities: overly similar faces compromise anonymity, while dissimilar ones reduce naturalness. Existi… ▽ More

    Submitted 24 September, 2025; originally announced September 2025.

  2. arXiv:2507.19292  [pdf, ps, other

    cs.CV

    PINO: Person-Interaction Noise Optimization for Long-Duration and Customizable Motion Generation of Arbitrary-Sized Groups

    Authors: Sakuya Ota, Qing Yu, Kent Fujiwara, Satoshi Ikehata, Ikuro Sato

    Abstract: Generating realistic group interactions involving multiple characters remains challenging due to increasing complexity as group size expands. While existing conditional diffusion models incrementally generate motions by conditioning on previously generated characters, they rely on single shared prompts, limiting nuanced control and leading to overly simplified interactions. In this paper, we intro… ▽ More

    Submitted 25 July, 2025; originally announced July 2025.

    Comments: Accepted to ICCV 2025, Project page: https://sinc865.github.io/pino/

  3. arXiv:2506.18882  [pdf, ps, other

    cs.CV

    Light of Normals: Unified Feature Representation for Universal Photometric Stereo

    Authors: Hong Li, Houyuan Chen, Chongjie Ye, Zhaoxi Chen, Bohan Li, Shaocong Xu, Xianda Guo, Xuhui Liu, Yikai Wang, Baochang Zhang, Satoshi Ikehata, Boxin Shi, Anyi Rao, Hao Zhao

    Abstract: Universal photometric stereo (PS) is defined by two factors: it must (i) operate under arbitrary, unknown lighting conditions and (ii) avoid reliance on specific illumination models. Despite progress (e.g., SDM UniPS), two challenges remain. First, current encoders cannot guarantee that illumination and normal information are decoupled. To enforce decoupling, we introduce LINO UniPS with two key c… ▽ More

    Submitted 27 September, 2025; v1 submitted 23 June, 2025; originally announced June 2025.

    Comments: Home: https://houyuanchen111.github.io/lino.github.io Github: https://github.com/houyuanchen111/LINO_UniPS HuggingFace Demo: https://huggingface.co/spaces/houyuanchen/lino

  4. arXiv:2503.18341  [pdf, other

    cs.CV

    PS-EIP: Robust Photometric Stereo Based on Event Interval Profile

    Authors: Kazuma Kitazawa, Takahito Aoto, Satoshi Ikehata, Tsuyoshi Takatani

    Abstract: Recently, the energy-efficient photometric stereo method using an event camera has been proposed to recover surface normals from events triggered by changes in logarithmic Lambertian reflections under a moving directional light source. However, EventPS treats each event interval independently, making it sensitive to noise, shadows, and non-Lambertian reflections. This paper proposes Photometric St… ▽ More

    Submitted 24 March, 2025; originally announced March 2025.

    Comments: CVPR2025

  5. arXiv:2502.14003  [pdf, other

    cs.LG cs.AI

    Rectified Lagrangian for Out-of-Distribution Detection in Modern Hopfield Networks

    Authors: Ryo Moriai, Nakamasa Inoue, Masayuki Tanaka, Rei Kawakami, Satoshi Ikehata, Ikuro Sato

    Abstract: Modern Hopfield networks (MHNs) have recently gained significant attention in the field of artificial intelligence because they can store and retrieve a large set of patterns with an exponentially large memory capacity. A MHN is generally a dynamical system defined with Lagrangians of memory and feature neurons, where memories associated with in-distribution (ID) samples are represented by attract… ▽ More

    Submitted 19 February, 2025; originally announced February 2025.

    Comments: Accepted to AAAI 2025

  6. arXiv:2410.20716  [pdf, other

    cs.CV

    Physics-Free Spectrally Multiplexed Photometric Stereo under Unknown Spectral Composition

    Authors: Satoshi Ikehata, Yuta Asano

    Abstract: In this paper, we present a groundbreaking spectrally multiplexed photometric stereo approach for recovering surface normals of dynamic surfaces without the need for calibrated lighting or sensors, a notable advancement in the field traditionally hindered by stringent prerequisites and spectral ambiguity. By embracing spectral ambiguity as an advantage, our technique enables the generation of trai… ▽ More

    Submitted 28 October, 2024; originally announced October 2024.

    Comments: ECCV2024 (Oral)

  7. arXiv:2410.20306  [pdf, other

    cs.CV

    GUMBEL-NERF: Representing Unseen Objects as Part-Compositional Neural Radiance Fields

    Authors: Yusuke Sekikawa, Chingwei Hsu, Satoshi Ikehata, Rei Kawakami, Ikuro Sato

    Abstract: We propose Gumbel-NeRF, a mixture-of-expert (MoE) neural radiance fields (NeRF) model with a hindsight expert selection mechanism for synthesizing novel views of unseen objects. Previous studies have shown that the MoE structure provides high-quality representations of a given large-scale scene consisting of many objects. However, we observe that such a MoE NeRF model often produces low-quality re… ▽ More

    Submitted 26 October, 2024; originally announced October 2024.

    Comments: 7 pages. Presented at ICIP2024

  8. arXiv:2409.00674  [pdf, other

    cs.CV

    MERLiN: Single-Shot Material Estimation and Relighting for Photometric Stereo

    Authors: Ashish Tiwari, Satoshi Ikehata, Shanmuganathan Raman

    Abstract: Photometric stereo typically demands intricate data acquisition setups involving multiple light sources to recover surface normals accurately. In this paper, we propose MERLiN, an attention-based hourglass network that integrates single image-based inverse rendering and relighting within a single unified framework. We evaluate the performance of photometric stereo methods using these relit images… ▽ More

    Submitted 1 September, 2024; originally announced September 2024.

    Comments: Accepted in ECCV 2024

  9. arXiv:2408.04844  [pdf, other

    cs.HC

    Investigating the Perception of Facial Anonymization Techniques in 360° Videos

    Authors: Leslie Wöhler, Satoshi Ikehata, Kiyoharu Aizawa

    Abstract: In this work, we investigate facial anonymization techniques in 360° videos and assess their influence on the perceived realism, anonymization effect, and presence of participants. In comparison to traditional footage, 360° videos can convey engaging, immersive experiences that accurately represent the atmosphere of real-world locations. As the entire environment is captured simultaneously, it is… ▽ More

    Submitted 8 August, 2024; originally announced August 2024.

  10. arXiv:2405.05924  [pdf

    cs.HC

    Privacy Protection and Video Manipulation in Immersive Media

    Authors: Leslie Wöhler, Satoshi Ikehata, Kiyoharu Aizawa

    Abstract: In comparison to traditional footage, 360° videos can convey engaging, immersive experiences and even be utilized to create interactive virtual environments. Like regular recordings, these videos need to consider the privacy of recorded people and could be targets for video manipulations. However, due to their properties like enhanced presence, the effects on users might differ from traditional, n… ▽ More

    Submitted 23 April, 2024; originally announced May 2024.

    Comments: This is an accepted position statement of CHI 2024 Workshop (Novel Approaches for Understanding and Mitigating Emerging New Harms in Immersive and Embodied Virtual Spaces: A Workshop at CHI 2024)

  11. arXiv:2403.16141  [pdf, other

    cs.CV

    Entity-NeRF: Detecting and Removing Moving Entities in Urban Scenes

    Authors: Takashi Otonari, Satoshi Ikehata, Kiyoharu Aizawa

    Abstract: Recent advancements in the study of Neural Radiance Fields (NeRF) for dynamic scenes often involve explicit modeling of scene dynamics. However, this approach faces challenges in modeling scene dynamics in urban environments, where moving objects of various categories and scales are present. In such settings, it becomes crucial to effectively eliminate moving objects to accurately reconstruct stat… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

    Comments: Accepted by IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2024), Project website: https://otonari726.github.io/entitynerf/

  12. arXiv:2303.15724  [pdf, other

    cs.CV cs.GR

    Scalable, Detailed and Mask-Free Universal Photometric Stereo

    Authors: Satoshi Ikehata

    Abstract: In this paper, we introduce SDM-UniPS, a groundbreaking Scalable, Detailed, Mask-free, and Universal Photometric Stereo network. Our approach can recover astonishingly intricate surface normal maps, rivaling the quality of 3D scanners, even when images are captured under unknown, spatially-varying lighting conditions in uncontrolled environments. We have extended previous universal photometric ste… ▽ More

    Submitted 28 March, 2023; originally announced March 2023.

    Comments: CVPR 2023 (Highlight). The source code will be available at https://github.com/satoshi-ikehata/SDM-UniPS-CVPR2023

  13. arXiv:2212.03635  [pdf, other

    cs.CV cs.GR

    Non-uniform Sampling Strategies for NeRF on 360{\textdegree} images

    Authors: Takashi Otonari, Satoshi Ikehata, Kiyoharu Aizawa

    Abstract: In recent years, the performance of novel view synthesis using perspective images has dramatically improved with the advent of neural radiance fields (NeRF). This study proposes two novel techniques that effectively build NeRF for 360{\textdegree} omnidirectional images. Due to the characteristics of a 360{\textdegree} image of ERP format that has spatial distortion in their high latitude regions… ▽ More

    Submitted 7 December, 2022; originally announced December 2022.

    Comments: Accepted at the 33rd British Machine Vision Conference (BMVC) 2022

  14. arXiv:2211.11386  [pdf, other

    cs.CV

    PS-Transformer: Learning Sparse Photometric Stereo Network using Self-Attention Mechanism

    Authors: Satoshi Ikehata

    Abstract: Existing deep calibrated photometric stereo networks basically aggregate observations under different lights based on the pre-defined operations such as linear projection and max pooling. While they are effective with the dense capture, simple first-order operations often fail to capture the high-order interactions among observations under small number of different lights. To tackle this issue, th… ▽ More

    Submitted 21 November, 2022; originally announced November 2022.

    Comments: BMVC2021. Code and Supplementary are available at https://github.com/satoshi-ikehata/PS-Transformer-BMVC2021

    Journal ref: BMVC. Vol. 2. No. 4. 2021

  15. Saliency-based Multiple Region of Interest Detection from a Single 360° image

    Authors: Yuuki Sawabe, Satoshi Ikehata, Kiyoharu Aizawa

    Abstract: 360° images are informative -- it contains omnidirectional visual information around the camera. However, the areas that cover a 360° image is much larger than the human's field of view, therefore important information in different view directions is easily overlooked. To tackle this issue, we propose a method for predicting the optimal set of Region of Interest (RoI) from a single 360° image usin… ▽ More

    Submitted 8 September, 2022; originally announced September 2022.

    Journal ref: in IEEE Access, vol. 10, pp. 89124-89133, 2022

  16. arXiv:2206.02452  [pdf, other

    cs.CV eess.IV

    Universal Photometric Stereo Network using Global Lighting Contexts

    Authors: Satoshi Ikehata

    Abstract: This paper tackles a new photometric stereo task, named universal photometric stereo. Unlike existing tasks that assumed specific physical lighting models; hence, drastically limited their usability, a solution algorithm of this task is supposed to work for objects with diverse shapes and materials under arbitrary lighting variations without assuming any specific models. To solve this extremely ch… ▽ More

    Submitted 6 June, 2022; originally announced June 2022.

    Comments: Accepted to CVPR2022. Code and Dataset at https://satoshi-ikehata.github.io/cvpr2022/univps_cvpr2022.html

  17. arXiv:2204.04634  [pdf, other

    cs.CV cs.MM

    Intersection Prediction from Single 360° Image via Deep Detection of Possible Direction of Travel

    Authors: Naoki Sugimoto, Satoshi Ikehata, Kiyoharu Aizawa

    Abstract: Movie-Map, an interactive first-person-view map that engages the user in a simulated walking experience, comprises short 360° video segments separated by traffic intersections that are seamlessly connected according to the viewer's direction of travel. However, in wide urban-scale areas with numerous intersecting roads, manual intersection segmentation requires significant human effort. Therefore,… ▽ More

    Submitted 10 April, 2022; originally announced April 2022.

    Comments: Accepted for publication in BMVC

  18. arXiv:2202.03176  [pdf, other

    cs.CV

    Field-of-View IoU for Object Detection in 360° Images

    Authors: Miao Cao, Satoshi Ikehata, Kiyoharu Aizawa

    Abstract: 360° cameras have gained popularity over the last few years. In this paper, we propose two fundamental techniques -- Field-of-View IoU (FoV-IoU) and 360Augmentation for object detection in 360° images. Although most object detection neural networks designed for the perspective images are applicable to 360° images in equirectangular projection (ERP) format, their performance deteriorates owing to t… ▽ More

    Submitted 22 September, 2022; v1 submitted 7 February, 2022; originally announced February 2022.

  19. arXiv:1808.10093  [pdf, other

    cs.CV

    CNN-PS: CNN-based Photometric Stereo for General Non-Convex Surfaces

    Authors: Satoshi Ikehata

    Abstract: Most conventional photometric stereo algorithms inversely solve a BRDF-based image formation model. However, the actual imaging process is often far more complex due to the global light transport on the non-convex surfaces. This paper presents a photometric stereo network that directly learns relationships between the photometric stereo input and surface normals of a scene. For handling unordered,… ▽ More

    Submitted 29 August, 2018; originally announced August 2018.

    Comments: Accepted in ECCV 2018 (ECCV2018). Source code and supplementary are available at https://github.com/satoshi-ikehata/CNN-PS

  20. arXiv:1808.08544  [pdf, other

    cs.CV

    Scale Drift Correction of Camera Geo-Localization using Geo-Tagged Images

    Authors: Kazuya Iwami, Satoshi Ikehata, Kiyoharu Aizawa

    Abstract: Camera geo-localization from a monocular video is a fundamental task for video analysis and autonomous navigation. Although 3D reconstruction is a key technique to obtain camera poses, monocular 3D reconstruction in a large environment tends to result in the accumulation of errors in rotation, translation, and especially in scale: a problem known as scale drift. To overcome these errors, we propos… ▽ More

    Submitted 26 August, 2018; originally announced August 2018.

    Comments: ECCV Workshop CVRSUAD

  21. arXiv:1612.01256  [pdf, other

    cs.CV

    Panoramic Structure from Motion via Geometric Relationship Detection

    Authors: Satoshi Ikehata, Ivaylo Boyadzhiev, Qi Shan, Yasutaka Furukawa

    Abstract: This paper addresses the problem of Structure from Motion (SfM) for indoor panoramic image streams, extremely challenging even for the state-of-the-art due to the lack of textures and minimal parallax. The key idea is the fusion of single-view and multi-view reconstruction techniques via geometric relationship detection (e.g., detecting 2D lines as coplanar in 3D). Rough geometry suffices to perfo… ▽ More

    Submitted 5 December, 2016; originally announced December 2016.