Skip to main content

Showing 1–2 of 2 results for author: Hirayama, Y

Searching in archive cs. Search in all archives.
.
  1. arXiv:2411.13802  [pdf, other

    cs.CL

    SemiKong: Curating, Training, and Evaluating A Semiconductor Industry-Specific Large Language Model

    Authors: Christopher Nguyen, William Nguyen, Atsushi Suzuki, Daisuke Oku, Hong An Phan, Sang Dinh, Zooey Nguyen, Anh Ha, Shruti Raghavan, Huy Vo, Thang Nguyen, Lan Nguyen, Yoshikuni Hirayama

    Abstract: Large Language Models (LLMs) have demonstrated the potential to address some issues within the semiconductor industry. However, they are often general-purpose models that lack the specialized knowledge needed to tackle the unique challenges of this sector, such as the intricate physics and chemistry of semiconductor devices and processes. SemiKong, the first industry-specific LLM for the semicondu… ▽ More

    Submitted 21 November, 2024; v1 submitted 20 November, 2024; originally announced November 2024.

    Comments: On-going work

  2. arXiv:2211.13402  [pdf, other

    cs.LG

    MP-GELU Bayesian Neural Networks: Moment Propagation by GELU Nonlinearity

    Authors: Yuki Hirayama, Sinya Takamaeda-Yamazaki

    Abstract: Bayesian neural networks (BNNs) have been an important framework in the study of uncertainty quantification. Deterministic variational inference, one of the inference methods, utilizes moment propagation to compute the predictive distributions and objective functions. Unfortunately, deriving the moments requires computationally expensive Taylor expansion in nonlinear functions, such as a rectified… ▽ More

    Submitted 23 November, 2022; originally announced November 2022.

    Comments: 9 pages, 1 figures