Skip to main content

Showing 1–12 of 12 results for author: Yamaguchi, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2507.06261  [pdf, ps, other

    cs.CL cs.AI

    Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities

    Authors: Gheorghe Comanici, Eric Bieber, Mike Schaekermann, Ice Pasupat, Noveen Sachdeva, Inderjit Dhillon, Marcel Blistein, Ori Ram, Dan Zhang, Evan Rosen, Luke Marris, Sam Petulla, Colin Gaffney, Asaf Aharoni, Nathan Lintz, Tiago Cardal Pais, Henrik Jacobsson, Idan Szpektor, Nan-Jiang Jiang, Krishna Haridasan, Ahmed Omran, Nikunj Saunshi, Dara Bahri, Gaurav Mishra, Eric Chu , et al. (3264 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 2.X model family: Gemini 2.5 Pro and Gemini 2.5 Flash, as well as our earlier Gemini 2.0 Flash and Flash-Lite models. Gemini 2.5 Pro is our most capable model yet, achieving SoTA performance on frontier coding and reasoning benchmarks. In addition to its incredible coding and reasoning skills, Gemini 2.5 Pro is a thinking model that excels at multimodal unde… ▽ More

    Submitted 11 July, 2025; v1 submitted 7 July, 2025; originally announced July 2025.

    Comments: 72 pages, 17 figures

  2. arXiv:2505.19500  [pdf, ps, other

    cs.CV

    Objective, Absolute and Hue-aware Metrics for Intrinsic Image Decomposition on Real-World Scenes: A Proof of Concept

    Authors: Shogo Sato, Masaru Tsuchida, Mariko Yamaguchi, Takuhiro Kaneko, Kazuhiko Murasaki, Taiga Yoshida, Ryuichi Tanida

    Abstract: Intrinsic image decomposition (IID) is the task of separating an image into albedo and shade. In real-world scenes, it is difficult to quantitatively assess IID quality due to the unavailability of ground truth. The existing method provides the relative reflection intensities based on human-judged annotations. However, these annotations have challenges in subjectivity, relative evaluation, and hue… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

    Comments: copyright 2025 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

  3. Controlling the color appearance of objects by optimizing the illumination spectrum

    Authors: Mariko Yamaguchi, Masaru Tsuchida, Takahiro Matsumoto, Tetsuro Tokunaga, Takayoshi Mochizuki

    Abstract: We have developed an innovative lighting system that changes specific target colors while keeping the lights appearing naturally white. By precisely controlling the spectral power distribution (SPD) of illumination and harnessing the unique phenomenon of metamerism, our system achieves unique color variations in ways you've never seen before. Our system calculates the optimal SPDs of illumination… ▽ More

    Submitted 17 June, 2024; originally announced July 2024.

    ACM Class: J.5

    Journal ref: SIGGRAPH Emerging Technologies (2024)

  4. arXiv:2303.17750  [pdf, other

    cs.CL

    Design by Contract Framework for Quantum Software

    Authors: Masaomi Yamaguchi, Nobukazu Yoshioka

    Abstract: To realize reliable quantum software, techniques to automatically ensure the quantum software's correctness have recently been investigated. However, they primarily focus on fixed quantum circuits rather than the procedure of building quantum circuits. Despite being a common approach, the correctness of building circuits using different parameters following the same procedure is not guaranteed. To… ▽ More

    Submitted 30 March, 2023; originally announced March 2023.

  5. arXiv:2108.13783  [pdf, other

    cs.PL

    Synbit: Synthesizing Bidirectional Programs using Unidirectional Sketches

    Authors: Masaomi Yamaguchi, Kazutaka Matsuda, Cristina David, Meng Wang

    Abstract: We propose a technique for synthesizing bidirectional programs from the corresponding unidirectional code plus a few input/output examples. The core ideas are: (1) constructing a sketch using the given unidirectional program as a specification, and (2) filling the sketch in a modular fashion by exploiting the properties of bidirectional programs. These ideas are enabled by our choice of programmin… ▽ More

    Submitted 6 October, 2021; v1 submitted 31 August, 2021; originally announced August 2021.

    Comments: The is the full version of a paper "Synbit: Synthesizing Bidirectional Programs using Unidirectional Sketches" accepted by OOPSLA 2021. The accepted paper is available at https://doi.org/10.1145/3485482

  6. arXiv:2108.12165  [pdf, ps, other

    cs.LG cs.CV

    LassoLayer: Nonlinear Feature Selection by Switching One-to-one Links

    Authors: Akihito Sudo, Teng Teck Hou, Masaki Yamaguchi, Yoshinori Tone

    Abstract: Along with the desire to address more complex problems, feature selection methods have gained in importance. Feature selection methods can be classified into wrapper method, filter method, and embedded method. Being a powerful embedded feature selection method, Lasso has attracted the attention of many researchers. However, as a linear approach, the applicability of Lasso has been limited. In this… ▽ More

    Submitted 27 August, 2021; originally announced August 2021.

  7. arXiv:1907.08338  [pdf, other

    eess.AS cs.LG cs.SD stat.ML

    Batch Uniformization for Minimizing Maximum Anomaly Score of DNN-based Anomaly Detection in Sounds

    Authors: Yuma Koizumi, Shoichiro Saito, Masataka Yamaguchi, Shin Murata, Noboru Harada

    Abstract: Use of an autoencoder (AE) as a normal model is a state-of-the-art technique for unsupervised-anomaly detection in sounds (ADS). The AE is trained to minimize the sample mean of the anomaly score of normal sounds in a mini-batch. One problem with this approach is that the anomaly score of rare-normal sounds becomes higher than that of frequent-normal sounds, because the sample mean is strongly aff… ▽ More

    Submitted 18 July, 2019; originally announced July 2019.

    Comments: 5 pages, to appear in IEEE WASPAA 2019

  8. arXiv:1902.07707  [pdf, ps, other

    cs.ET

    An Energy-efficient Time-domain Analog VLSI Neural Network Processor Based on a Pulse-width Modulation Approach

    Authors: Masatoshi Yamaguchi, Goki Iwamoto, Hakaru Tamukoh, Takashi Morie

    Abstract: A time-domain analog-weighted-sum calculation model based on a pulse-width modulation (PWM) approach is proposed. The proposed calculation model can be applied to any types of network structure including multi-layer feedforward networks. We also propose very large-scale integrated (VLSI) circuits to implement the proposed model. Unlike the conventional analog voltage or current mode circuits used… ▽ More

    Submitted 16 February, 2019; originally announced February 2019.

    Comments: arXiv admin note: text overlap with arXiv:1810.06819

  9. arXiv:1812.05796  [pdf, other

    stat.ML cs.LG cs.SD eess.AS

    AdaFlow: Domain-Adaptive Density Estimator with Application to Anomaly Detection and Unpaired Cross-Domain Translation

    Authors: Masataka Yamaguchi, Yuma Koizumi, Noboru Harada

    Abstract: We tackle unsupervised anomaly detection (UAD), a problem of detecting data that significantly differ from normal data. UAD is typically solved by using density estimation. Recently, deep neural network (DNN)-based density estimators, such as Normalizing Flows, have been attracting attention. However, one of their drawbacks is the difficulty in adapting them to the change in the normal data's dist… ▽ More

    Submitted 13 March, 2019; v1 submitted 14 December, 2018; originally announced December 2018.

    Comments: Accepted to ICASSP2019

  10. arXiv:1710.11549  [pdf, other

    cs.SD cs.MM eess.AS

    Melody Generation for Pop Music via Word Representation of Musical Properties

    Authors: Andrew Shin, Leopold Crestel, Hiroharu Kato, Kuniaki Saito, Katsunori Ohnishi, Masataka Yamaguchi, Masahiro Nakawaki, Yoshitaka Ushiku, Tatsuya Harada

    Abstract: Automatic melody generation for pop music has been a long-time aspiration for both AI researchers and musicians. However, learning to generate euphonious melody has turned out to be highly challenging due to a number of factors. Representation of multivariate property of notes has been one of the primary challenges. It is also difficult to remain in the permissible spectrum of musical variety, out… ▽ More

    Submitted 31 October, 2017; originally announced October 2017.

    Comments: submitted to ICLR 2018

  11. arXiv:1704.07945  [pdf, ps, other

    cs.CV

    Spatio-temporal Person Retrieval via Natural Language Queries

    Authors: Masataka Yamaguchi, Kuniaki Saito, Yoshitaka Ushiku, Tatsuya Harada

    Abstract: In this paper, we address the problem of spatio-temporal person retrieval from multiple videos using a natural language query, in which we output a tube (i.e., a sequence of bounding boxes) which encloses the person described by the query. For this problem, we introduce a novel dataset consisting of videos containing people annotated with bounding boxes for each second and with five natural langua… ▽ More

    Submitted 22 August, 2017; v1 submitted 25 April, 2017; originally announced April 2017.

    Comments: Accepted to ICCV2017

  12. arXiv:1603.09046  [pdf, other

    cs.CV

    Dense Image Representation with Spatial Pyramid VLAD Coding of CNN for Locally Robust Captioning

    Authors: Andrew Shin, Masataka Yamaguchi, Katsunori Ohnishi, Tatsuya Harada

    Abstract: The workflow of extracting features from images using convolutional neural networks (CNN) and generating captions with recurrent neural networks (RNN) has become a de-facto standard for image captioning task. However, since CNN features are originally designed for classification task, it is mostly concerned with the main conspicuous element of the image, and often fails to correctly convey informa… ▽ More

    Submitted 30 March, 2016; originally announced March 2016.

    Comments: submitted to ECCV2016