Skip to main content

Showing 1–4 of 4 results for author: Effendi, J

.
  1. arXiv:2403.15484  [pdf, other

    cs.CL cs.LG

    RakutenAI-7B: Extending Large Language Models for Japanese

    Authors: Rakuten Group, Aaron Levine, Connie Huang, Chenguang Wang, Eduardo Batista, Ewa Szymanska, Hongyi Ding, Hou Wei Chou, Jean-François Pessiot, Johanes Effendi, Justin Chiu, Kai Torben Ohlhus, Karan Chopra, Keiji Shinzato, Koji Murakami, Lee Xiong, Lei Chen, Maki Kubota, Maksim Tkachenko, Miroku Lee, Naoki Takahashi, Prathyusha Jwalapuram, Ryutaro Tatsushima, Saurabh Jain, Sunil Kumar Yadav , et al. (5 additional authors not shown)

    Abstract: We introduce RakutenAI-7B, a suite of Japanese-oriented large language models that achieve the best performance on the Japanese LM Harness benchmarks among the open 7B models. Along with the foundation model, we release instruction- and chat-tuned models, RakutenAI-7B-instruct and RakutenAI-7B-chat respectively, under the Apache 2.0 license.

    Submitted 21 March, 2024; originally announced March 2024.

  2. arXiv:2011.02099  [pdf, other

    cs.CL cs.SD eess.AS

    Augmenting Images for ASR and TTS through Single-loop and Dual-loop Multimodal Chain Framework

    Authors: Johanes Effendi, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura

    Abstract: Previous research has proposed a machine speech chain to enable automatic speech recognition (ASR) and text-to-speech synthesis (TTS) to assist each other in semi-supervised learning and to avoid the need for a large amount of paired speech and text data. However, that framework still requires a large amount of unpaired (speech or text) data. A prototype multimodal machine chain was then explored… ▽ More

    Submitted 3 November, 2020; originally announced November 2020.

    Comments: Accepted at INTERSPEECH 2020

  3. arXiv:1906.00579  [pdf, other

    cs.CL cs.SD eess.AS

    Listening while Speaking and Visualizing: Improving ASR through Multimodal Chain

    Authors: Johanes Effendi, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura

    Abstract: Previously, a machine speech chain, which is based on sequence-to-sequence deep learning, was proposed to mimic speech perception and production behavior. Such chains separately processed listening and speaking by automatic speech recognition (ASR) and text-to-speech synthesis (TTS) and simultaneously enabled them to teach each other in semi-supervised learning when they received unpaired data. Un… ▽ More

    Submitted 14 November, 2019; v1 submitted 3 June, 2019; originally announced June 2019.

    Comments: Accepted in IEEE ASRU 2019

  4. arXiv:1701.08744  [pdf

    cs.IR cs.AI cs.LG

    Click Through Rate Prediction for Contextual Advertisment Using Linear Regression

    Authors: Muhammad Junaid Effendi, Syed Abbas Ali

    Abstract: This research presents an innovative and unique way of solving the advertisement prediction problem which is considered as a learning problem over the past several years. Online advertising is a multi-billion-dollar industry and is growing every year with a rapid pace. The goal of this research is to enhance click through rate of the contextual advertisements using Linear Regression. In order to a… ▽ More

    Submitted 30 January, 2017; originally announced January 2017.

    Comments: 8 pages, 13 Figures, 11 Tables