Skip to main content

Showing 1–5 of 5 results for author: Motomura, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.11730  [pdf, ps, other

    cs.AI cs.LG

    Rethinking Optimal Verification Granularity for Compute-Efficient Test-Time Scaling

    Authors: Hao Mark Chen, Guanxi Lu, Yasuyuki Okoshi, Zhiwen Mo, Masato Motomura, Hongxiang Fan

    Abstract: Test-time scaling (TTS) has proven effective in enhancing the reasoning capabilities of large language models (LLMs). Verification plays a key role in TTS, simultaneously influencing (1) reasoning performance and (2) compute efficiency, due to the quality and computational cost of verification. In this work, we challenge the conventional paradigms of verification, and make the first attempt toward… ▽ More

    Submitted 16 May, 2025; originally announced May 2025.

    Comments: Preprint. Under review

  2. arXiv:2504.08315  [pdf, other

    math.OC cs.NE

    Annealed Mean Field Descent Is Highly Effective for Quadratic Unconstrained Binary Optimization

    Authors: Kyo Kuroki, Thiem Van Chu, Masato Motomura, Kazushi Kawamura

    Abstract: In recent years, formulating various combinatorial optimization problems as Quadratic Unconstrained Binary Optimization (QUBO) has gained significant attention as a promising approach for efficiently obtaining optimal or near-optimal solutions. While QUBO offers a general-purpose framework, existing solvers often struggle with performance variability across different problems. This paper (i) the… ▽ More

    Submitted 11 April, 2025; originally announced April 2025.

  3. arXiv:2402.14029  [pdf, other

    cs.LG cs.AI stat.ML

    Partially Frozen Random Networks Contain Compact Strong Lottery Tickets

    Authors: Hikari Otsuka, Daiki Chijiwa, Ángel López García-Arias, Yasuyuki Okoshi, Kazushi Kawamura, Thiem Van Chu, Daichi Fujiki, Susumu Takeuchi, Masato Motomura

    Abstract: Randomly initialized dense networks contain subnetworks that achieve high accuracy without weight learning--strong lottery tickets (SLTs). Recently, Gadhikar et al. (2023) demonstrated that SLTs could also be found within a randomly pruned source network. This phenomenon can be exploited to further compress the small memory size required by SLTs. However, their method is limited to SLTs that are e… ▽ More

    Submitted 8 February, 2025; v1 submitted 19 February, 2024; originally announced February 2024.

    Comments: Accepted at TMLR

  4. arXiv:2312.03236  [pdf, other

    cs.LG cs.AI stat.ML

    Multicoated and Folded Graph Neural Networks with Strong Lottery Tickets

    Authors: Jiale Yan, Hiroaki Ito, Ángel López García-Arias, Yasuyuki Okoshi, Hikari Otsuka, Kazushi Kawamura, Thiem Van Chu, Masato Motomura

    Abstract: The Strong Lottery Ticket Hypothesis (SLTH) demonstrates the existence of high-performing subnetworks within a randomly initialized model, discoverable through pruning a convolutional neural network (CNN) without any weight training. A recent study, called Untrained GNNs Tickets (UGT), expanded SLTH from CNNs to shallow graph neural networks (GNNs). However, discrepancies persist when comparing ba… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

    Comments: 9 pages, accepted in the Second Learning on Graphs Conference (LoG 2023)

    Journal ref: Proceedings of the Second Learning on Graphs Conference (LoG 2023), PMLR 231

  5. arXiv:2111.12330  [pdf, other

    cs.CV

    Hidden-Fold Networks: Random Recurrent Residuals Using Sparse Supermasks

    Authors: Ángel López García-Arias, Masanori Hashimoto, Masato Motomura, Jaehoon Yu

    Abstract: Deep neural networks (DNNs) are so over-parametrized that recent research has found them to already contain a subnetwork with high accuracy at their randomly initialized state. Finding these subnetworks is a viable alternative training method to weight learning. In parallel, another line of work has hypothesized that deep residual networks (ResNets) are trying to approximate the behaviour of shall… ▽ More

    Submitted 24 November, 2021; originally announced November 2021.

    Comments: 13 pages, 7 figures. Accepted to the British Machine Vision Conference (BMVC) 2021