Skip to main content

Showing 1–3 of 3 results for author: Imaizumi, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.23544  [pdf, ps, other

    cs.LG math.OC

    Both Asymptotic and Non-Asymptotic Convergence of Quasi-Hyperbolic Momentum using Increasing Batch Size

    Authors: Kento Imaizumi, Hideaki Iiduka

    Abstract: Momentum methods were originally introduced for their superiority to stochastic gradient descent (SGD) in deterministic settings with convex objective functions. However, despite their widespread application to deep neural networks -- a representative case of stochastic nonconvex optimization -- the theoretical justification for their effectiveness in such settings remains limited. Quasi-hyperboli… ▽ More

    Submitted 1 July, 2025; v1 submitted 30 June, 2025; originally announced June 2025.

  2. arXiv:2403.18151  [pdf

    eess.IV cs.CV physics.med-ph

    Automated Report Generation for Lung Cytological Images Using a CNN Vision Classifier and Multiple-Transformer Text Decoders: Preliminary Study

    Authors: Atsushi Teramoto, Ayano Michiba, Yuka Kiriyama, Tetsuya Tsukamoto, Kazuyoshi Imaizumi, Hiroshi Fujita

    Abstract: Cytology plays a crucial role in lung cancer diagnosis. Pulmonary cytology involves cell morphological characterization in the specimen and reporting the corresponding findings, which are extremely burdensome tasks. In this study, we propose a report-generation technique for lung cytology images. In total, 71 benign and 135 malignant pulmonary cytology specimens were collected. Patch images were e… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: This work has been submitted to the IEEE for possible publication

  3. arXiv:2402.15344  [pdf, other

    stat.ML cs.LG

    Iteration and Stochastic First-order Oracle Complexities of Stochastic Gradient Descent using Constant and Decaying Learning Rates

    Authors: Kento Imaizumi, Hideaki Iiduka

    Abstract: The performance of stochastic gradient descent (SGD), which is the simplest first-order optimizer for training deep neural networks, depends on not only the learning rate but also the batch size. They both affect the number of iterations and the stochastic first-order oracle (SFO) complexity needed for training. In particular, the previous numerical results indicated that, for SGD using a constant… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

    Comments: The latest version was updated on Feb. 23. arXiv admin note: text overlap with arXiv:2307.13831