Showing 1–2 of 2 results for author: Kageyama, Y

Search v0.5.6 released 2020-02-24

arXiv:2309.01984 [pdf, other]

physics.optics cs.GR

Focal Surface Projection: Extending Projector Depth-of-Field Using a Phase-Only Spatial Light Modulator

Authors: Fumitaka Ueda, Yuta Kageyama, Daisuke Iwai, Kosuke Sato

Abstract: We present a focal surface projection to solve the narrow depth-of-field problem in projection mapping applications. We apply a phase-only spatial light modulator to realize nonuniform focusing distances, whereby the projected contents appear focused on a surface with considerable depth variations. The feasibility of the proposed technique was validated through a physical experiment. We present a focal surface projection to solve the narrow depth-of-field problem in projection mapping applications. We apply a phase-only spatial light modulator to realize nonuniform focusing distances, whereby the projected contents appear focused on a surface with considerable depth variations. The feasibility of the proposed technique was validated through a physical experiment. △ Less

Submitted 5 September, 2023; originally announced September 2023.
arXiv:1811.05233 [pdf]

cs.LG cs.CV

Massively Distributed SGD: ImageNet/ResNet-50 Training in a Flash

Authors: Hiroaki Mikami, Hisahiro Suganuma, Pongsakorn U-chupala, Yoshiki Tanaka, Yuichi Kageyama

Abstract: Scaling the distributed deep learning to a massive GPU cluster level is challenging due to the instability of the large mini-batch training and the overhead of the gradient synchronization. We address the instability of the large mini-batch training with batch-size control and label smoothing. We address the overhead of the gradient synchronization with 2D-Torus all-reduce. Specifically, 2D-Torus… ▽ More Scaling the distributed deep learning to a massive GPU cluster level is challenging due to the instability of the large mini-batch training and the overhead of the gradient synchronization. We address the instability of the large mini-batch training with batch-size control and label smoothing. We address the overhead of the gradient synchronization with 2D-Torus all-reduce. Specifically, 2D-Torus all-reduce arranges GPUs in a logical 2D grid and performs a series of collective operation in different orientations. These two techniques are implemented with Neural Network Libraries (NNL). We have successfully trained ImageNet/ResNet-50 in 122 seconds without significant accuracy loss on ABCI cluster. △ Less

Submitted 5 March, 2019; v1 submitted 13 November, 2018; originally announced November 2018.

Search v0.5.6 released 2020-02-24