Skip to main content

Showing 1–50 of 62 results for author: Nakamura, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2507.03897  [pdf, ps, other

    cs.LG stat.ME stat.ML

    GenAI-Powered Inference

    Authors: Kosuke Imai, Kentaro Nakamura

    Abstract: We introduce GenAI-Powered Inference (GPI), a statistical framework for both causal and predictive inference using unstructured data, including text and images. GPI leverages open-source Generative Artificial Intelligence (GenAI) models - such as large language models and diffusion models - not only to generate unstructured data at scale but also to extract low-dimensional representations that cap… ▽ More

    Submitted 5 July, 2025; originally announced July 2025.

  2. arXiv:2505.12746  [pdf, ps, other

    cs.AI

    Correspondence of high-dimensional emotion structures elicited by video clips between humans and Multimodal LLMs

    Authors: Haruka Asanuma, Naoko Koide-Majima, Ken Nakamura, Takato Horii, Shinji Nishimoto, Masafumi Oizumi

    Abstract: Recent studies have revealed that human emotions exhibit a high-dimensional, complex structure. A full capturing of this complexity requires new approaches, as conventional models that disregard high dimensionality risk overlooking key nuances of human emotions. Here, we examined the extent to which the latest generation of rapidly evolving Multimodal Large Language Models (MLLMs) capture these hi… ▽ More

    Submitted 23 May, 2025; v1 submitted 19 May, 2025; originally announced May 2025.

    Comments: 25 pages, 7 figures

    ACM Class: I.2.7; I.2.10; I.5.1

  3. arXiv:2505.00779  [pdf, other

    cs.RO cs.LG eess.SY

    Uncertainty-aware Latent Safety Filters for Avoiding Out-of-Distribution Failures

    Authors: Junwon Seo, Kensuke Nakamura, Andrea Bajcsy

    Abstract: Recent advances in generative world models have enabled classical safe control methods, such as Hamilton-Jacobi (HJ) reachability, to generalize to complex robotic systems operating directly from high-dimensional sensor observations. However, obtaining comprehensive coverage of all safety-critical scenarios during world model training is extremely challenging. As a result, latent safety filters bu… ▽ More

    Submitted 1 May, 2025; originally announced May 2025.

  4. arXiv:2504.01348  [pdf, other

    cs.CV cs.IR

    Prompt-Guided Attention Head Selection for Focus-Oriented Image Retrieval

    Authors: Yuji Nozawa, Yu-Chieh Lin, Kazumoto Nakamura, Youyang Ng

    Abstract: The goal of this paper is to enhance pretrained Vision Transformer (ViT) models for focus-oriented image retrieval with visual prompting. In real-world image retrieval scenarios, both query and database images often exhibit complexity, with multiple objects and intricate backgrounds. Users often want to retrieve images with specific object, which we define as the Focus-Oriented Image Retrieval (FO… ▽ More

    Submitted 2 April, 2025; originally announced April 2025.

    Comments: Accepted to CVPR 2025 PixFoundation Workshop

  5. arXiv:2503.00871  [pdf, other

    cs.LG cs.AI cs.CR

    CyberCScope: Mining Skewed Tensor Streams and Online Anomaly Detection in Cybersecurity Systems

    Authors: Kota Nakamura, Koki Kawabata, Shungo Tanaka, Yasuko Matsubara, Yasushi Sakurai

    Abstract: Cybersecurity systems are continuously producing a huge number of time-stamped events in the form of high-order tensors, such as {count; time, port, flow duration, packet size, . . . }, and so how can we detect anomalies/intrusions in real time? How can we identify multiple types of intrusions and capture their characteristic behaviors? The tensor data consists of categorical and continuous attrib… ▽ More

    Submitted 2 March, 2025; originally announced March 2025.

    Comments: Accepted by WWW 2025 short research paper

  6. arXiv:2502.11989  [pdf, other

    cs.HC cs.AI cs.CV

    Characterizing Photorealism and Artifacts in Diffusion Model-Generated Images

    Authors: Negar Kamali, Karyn Nakamura, Aakriti Kumar, Angelos Chatzimparmpas, Jessica Hullman, Matthew Groh

    Abstract: Diffusion model-generated images can appear indistinguishable from authentic photographs, but these images often contain artifacts and implausibilities that reveal their AI-generated provenance. Given the challenge to public trust in media posed by photorealistic AI-generated images, we conducted a large-scale experiment measuring human detection accuracy on 450 diffusion-model generated images an… ▽ More

    Submitted 17 February, 2025; originally announced February 2025.

    Comments: 26 pages, 24 Figures, Accepted by ACM CHI 2025

  7. arXiv:2502.03702  [pdf, other

    cs.DS

    Tensor Decomposition Meets Knowledge Compilation: A Study Comparing Tensor Trains with OBDDs

    Authors: Ryoma Onaka, Kengo Nakamura, Masaaki Nishino, Norihito Yasuda

    Abstract: A knowledge compilation map analyzes tractable operations in Boolean function representations and compares their succinctness. This enables the selection of appropriate representations for different applications. In the knowledge compilation map, all representation classes are subsets of the negation normal form (NNF). However, Boolean functions may be better expressed by a representation that is… ▽ More

    Submitted 5 February, 2025; originally announced February 2025.

  8. arXiv:2502.00935  [pdf, other

    cs.RO cs.LG

    Generalizing Safety Beyond Collision-Avoidance via Latent-Space Reachability Analysis

    Authors: Kensuke Nakamura, Lasse Peters, Andrea Bajcsy

    Abstract: Hamilton-Jacobi (HJ) reachability is a rigorous mathematical framework that enables robots to simultaneously detect unsafe states and generate actions that prevent future failures. While in theory, HJ reachability can synthesize safe controllers for nonlinear systems and nonconvex constraints, in practice, it has been limited to hand-engineered collision-avoidance constraints modeled via low-dimen… ▽ More

    Submitted 30 April, 2025; v1 submitted 2 February, 2025; originally announced February 2025.

    Comments: 9 figures, 7 tables, RSS 2025

  9. arXiv:2411.05103  [pdf

    cs.HC

    MoHeat: A Modular Platform for High-Responsive Non-Contact Thermal Feedback Interactions

    Authors: Jiayi Xu, Kazuma Nakamura, Yoshihiro Kuroda, Masahiko Inami

    Abstract: MoHeat is a modular hardware and software platform designed for rapid prototyping of highly responsive, non-contact thermal feedback interactions. In our previous work, we developed an intensity-adjustable, highly responsive, non-contact thermal feedback system by integrating the vortex effect and thermal radiation. In this study, we further enhanced the system by developing an authoring tool that… ▽ More

    Submitted 7 November, 2024; originally announced November 2024.

    Comments: Part of proceedings of 6th International Conference AsiaHaptics 2024

  10. arXiv:2410.04801  [pdf, other

    cs.CV cs.LG

    Improving Image Clustering with Artifacts Attenuation via Inference-Time Attention Engineering

    Authors: Kazumoto Nakamura, Yuji Nozawa, Yu-Chieh Lin, Kengo Nakata, Youyang Ng

    Abstract: The goal of this paper is to improve the performance of pretrained Vision Transformer (ViT) models, particularly DINOv2, in image clustering task without requiring re-training or fine-tuning. As model size increases, high-norm artifacts anomaly appears in the patches of multi-head attention. We observe that this anomaly leads to reduced accuracy in zero-shot image clustering. These artifacts are c… ▽ More

    Submitted 7 October, 2024; originally announced October 2024.

    Comments: Accepted to ACCV 2024

  11. arXiv:2410.00903  [pdf, ps, other

    stat.AP cs.CL cs.LG

    Causal Representation Learning with Generative Artificial Intelligence: Application to Texts as Treatments

    Authors: Kosuke Imai, Kentaro Nakamura

    Abstract: In this paper, we demonstrate how to enhance the validity of causal inference with unstructured high-dimensional treatments like texts, by leveraging the power of generative Artificial Intelligence (GenAI). Specifically, we propose to use a deep generative model such as large language models (LLMs) to efficiently generate treatments and use their internal representation for subsequent causal effec… ▽ More

    Submitted 2 July, 2025; v1 submitted 1 October, 2024; originally announced October 2024.

  12. arXiv:2409.10924  [pdf, ps, other

    cs.IT

    Decoding Algorithm Correcting Single-Insertion Plus Single-Deletion for Non-binary Quantum Codes

    Authors: Ken Nakamura, Takayuki Nozaki

    Abstract: In this paper, we assume an error such that a single insertion occurs and then a single deletion occurs. Under such an error model, this paper provides a decoding algorithm for non-binary quantum codes constructed by Matsumoto and Hagiwara.

    Submitted 17 September, 2024; originally announced September 2024.

    Comments: 6 pages, submitted to ISITA 2024

  13. ARIM-mdx Data System: Towards a Nationwide Data Platform for Materials Science

    Authors: Masatoshi Hanai, Ryo Ishikawa, Mitsuaki Kawamura, Masato Ohnishi, Norio Takenaka, Kou Nakamura, Daiju Matsumura, Seiji Fujikawa, Hiroki Sakamoto, Yukinori Ochiai, Tetsuo Okane, Shin-Ichiro Kuroki, Atsuo Yamada, Toyotaro Suzumura, Junichiro Shiomi, Kenjiro Taura, Yoshio Mita, Naoya Shibata, Yuichi Ikuhara

    Abstract: In modern materials science, effective and high-volume data management across leading-edge experimental facilities and world-class supercomputers is indispensable for cutting-edge research. However, existing integrated systems that handle data from these resources have primarily focused just on smaller-scale cross-institutional or single-domain operations. As a result, they often lack the scalabil… ▽ More

    Submitted 4 November, 2024; v1 submitted 8 September, 2024; originally announced September 2024.

    Comments: IEEE BigData 2024, to appear. Project Page https://arim.mdx.jp/

  14. arXiv:2407.04921  [pdf, other

    cs.CV cs.AI stat.AP

    Aortic root landmark localization with optimal transport loss for heatmap regression

    Authors: Tsuyoshi Ishizone, Masaki Miyasaka, Sae Ochi, Norio Tada, Kazuyuki Nakamura

    Abstract: Anatomical landmark localization is gaining attention to ease the burden on physicians. Focusing on aortic root landmark localization, the three hinge points of the aortic valve can reduce the burden by automatically determining the valve size required for transcatheter aortic valve implantation surgery. Existing methods for landmark prediction of the aortic root mainly use time-consuming two-step… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  15. arXiv:2406.08651  [pdf

    cs.HC cs.AI cs.CV

    How to Distinguish AI-Generated Images from Authentic Photographs

    Authors: Negar Kamali, Karyn Nakamura, Angelos Chatzimparmpas, Jessica Hullman, Matthew Groh

    Abstract: The high level of photorealism in state-of-the-art diffusion models like Midjourney, Stable Diffusion, and Firefly makes it difficult for untrained humans to distinguish between real photographs and AI-generated images. To address this problem, we designed a guide to help readers develop a more critical eye toward identifying artifacts, inconsistencies, and implausibilities that often appear in AI… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: 54 pages, 189 Figures

  16. arXiv:2403.05074  [pdf, other

    cs.DS cs.CC

    Single Family Algebra Operation on BDDs and ZDDs Leads To Exponential Blow-Up

    Authors: Kengo Nakamura, Masaaki Nishino, Shuhei Denzumi

    Abstract: Binary decision diagram (BDD) and zero-suppressed binary decision diagram (ZDD) are data structures to represent a family of (sub)sets compactly, and it can be used as succinct indexes for a family of sets. To build BDD/ZDD representing a desired family of sets, there are many transformation operations that take BDDs/ZDDs as inputs and output BDD/ZDD representing the resultant family after perform… ▽ More

    Submitted 30 September, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

    Comments: 17 pages, 4 figures; accepted for ISAAC 2024

  17. arXiv:2403.04745  [pdf, other

    cs.RO

    Not All Errors Are Made Equal: A Regret Metric for Detecting System-level Trajectory Prediction Failures

    Authors: Kensuke Nakamura, Ran Tian, Andrea Bajcsy

    Abstract: Robot decision-making increasingly relies on data-driven human prediction models when operating around people. While these models are known to mispredict in out-of-distribution interactions, only a subset of prediction errors impact downstream robot performance. We propose characterizing such "system-level" prediction failures via the mathematical notion of regret: high-regret interactions are pre… ▽ More

    Submitted 9 November, 2024; v1 submitted 7 March, 2024; originally announced March 2024.

    Comments: 6 figures, 3 tables, Accepted to CoRL 2024

  18. Segmentation of Kidney Tumors on Non-Contrast CT Images using Protuberance Detection Network

    Authors: Taro Hatsutani, Akimichi Ichinose, Keigo Nakamura, Yoshiro Kitamura

    Abstract: Many renal cancers are incidentally found on non-contrast CT (NCCT) images. On contrast-enhanced CT (CECT) images, most kidney tumors, especially renal cancers, have different intensity values compared to normal tissues. However, on NCCT images, some tumors called isodensity tumors, have similar intensity values to the surrounding normal tissues, and can only be detected through a change in organ… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

    Comments: Accepted in MICCAI 2023

    Journal ref: Medical Image Computing and Computer Assisted Intervention - MICCAI 2023. MICCAI 2023. Lecture Notes in Computer Science, vol 14226. Springer, Cham

  19. Visual Grounding of Whole Radiology Reports for 3D CT Images

    Authors: Akimichi Ichinose, Taro Hatsutani, Keigo Nakamura, Yoshiro Kitamura, Satoshi Iizuka, Edgar Simo-Serra, Shoji Kido, Noriyuki Tomiyama

    Abstract: Building a large-scale training dataset is an essential problem in the development of medical image recognition systems. Visual grounding techniques, which automatically associate objects in images with corresponding descriptions, can facilitate labeling of large number of images. However, visual grounding of radiology reports for CT images remains challenging, because so many kinds of anomalies a… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

    Comments: 14 pages, 7 figures. Accepted at MICCAI 2023

    Journal ref: Medical Image Computing and Computer Assisted Intervention Lecture Notes in Computer Science 14224 (2023) 611-621

  20. arXiv:2309.01267  [pdf, other

    cs.RO cs.AI cs.LG eess.SY

    Deception Game: Closing the Safety-Learning Loop in Interactive Robot Autonomy

    Authors: Haimin Hu, Zixu Zhang, Kensuke Nakamura, Andrea Bajcsy, Jaime F. Fisac

    Abstract: An outstanding challenge for the widespread deployment of robotic systems like autonomous vehicles is ensuring safe interaction with humans without sacrificing performance. Existing safety methods often neglect the robot's ability to learn and adapt at runtime, leading to overly conservative behavior. This paper proposes a new closed-loop paradigm for synthesizing safe control policies that explic… ▽ More

    Submitted 1 November, 2023; v1 submitted 3 September, 2023; originally announced September 2023.

    Comments: Conference on Robot Learning 2023

  21. arXiv:2304.02687  [pdf, other

    eess.SY cs.RO

    Emergent Coordination through Game-Induced Nonlinear Opinion Dynamics

    Authors: Haimin Hu, Kensuke Nakamura, Kai-Chieh Hsu, Naomi Ehrich Leonard, Jaime Fernández Fisac

    Abstract: We present a multi-agent decision-making framework for the emergent coordination of autonomous agents whose intents are initially undecided. Dynamic non-cooperative games have been used to encode multi-agent interaction, but ambiguity arising from factors such as goal preference or the presence of multiple equilibria may lead to coordination issues, ranging from the "freezing robot" problem to uns… ▽ More

    Submitted 5 April, 2023; originally announced April 2023.

  22. arXiv:2303.03789  [pdf, other

    cs.LG cs.AI cs.IT

    Fast and Multi-aspect Mining of Complex Time-stamped Event Streams

    Authors: Kota Nakamura, Yasuko Matsubara, Koki Kawabata, Yuhei Umeda, Yuichiro Wada, Yasushi Sakurai

    Abstract: Given a huge, online stream of time-evolving events with multiple attributes, such as online shopping logs: (item, price, brand, time), and local mobility activities: (pick-up and drop-off locations, time), how can we summarize large, dynamic high-order tensor streams? How can we see any hidden patterns, rules, and anomalies? Our answer is to focus on two types of patterns, i.e., ''regimes'' and '… ▽ More

    Submitted 5 July, 2023; v1 submitted 7 March, 2023; originally announced March 2023.

    Comments: Accepted by WWW 2023

  23. arXiv:2211.11222  [pdf, other

    eess.AS cs.CL cs.SD

    Embedding a Differentiable Mel-cepstral Synthesis Filter to a Neural Speech Synthesis System

    Authors: Takenori Yoshimura, Shinji Takaki, Kazuhiro Nakamura, Keiichiro Oura, Yukiya Hono, Kei Hashimoto, Yoshihiko Nankaku, Keiichi Tokuda

    Abstract: This paper integrates a classic mel-cepstral synthesis filter into a modern neural speech synthesis system towards end-to-end controllable speech synthesis. Since the mel-cepstral synthesis filter is explicitly embedded in neural waveform models in the proposed system, both voice characteristics and the pitch of synthesized speech are highly controlled via a frequency warping parameter and fundame… ▽ More

    Submitted 21 November, 2022; originally announced November 2022.

    Comments: Submitted to ICASSP 2023

  24. arXiv:2210.05331  [pdf, other

    cs.LG

    Generalization Analysis on Learning with a Concurrent Verifier

    Authors: Masaaki Nishino, Kengo Nakamura, Norihito Yasuda

    Abstract: Machine learning technologies have been used in a wide range of practical systems. In practical situations, it is natural to expect the input-output pairs of a machine learning model to satisfy some requirements. However, it is difficult to obtain a model that satisfies requirements by just learning from examples. A simple solution is to add a module that checks whether the input-output pairs meet… ▽ More

    Submitted 11 October, 2022; originally announced October 2022.

  25. arXiv:2210.01199  [pdf, other

    cs.RO

    Online Update of Safety Assurances Using Confidence-Based Predictions

    Authors: Kensuke Nakamura, Somil Bansal

    Abstract: Robots such as autonomous vehicles and assistive manipulators are increasingly operating in dynamic environments and close physical proximity to people. In such scenarios, the robot can leverage a human motion predictor to predict their future states and plan safe and efficient trajectories. However, no model is ever perfect -- when the observed human behavior deviates from the model predictions,… ▽ More

    Submitted 5 June, 2023; v1 submitted 3 October, 2022; originally announced October 2022.

    Comments: 7 pages, 4 figures

  26. Parameter-Conditioned Reachable Sets for Updating Safety Assurances Online

    Authors: Javier Borquez, Kensuke Nakamura, Somil Bansal

    Abstract: Hamilton-Jacobi (HJ) reachability analysis is a powerful tool for analyzing the safety of autonomous systems. However, the provided safety assurances are often predicated on the assumption that once deployed, the system or its environment does not evolve. Online, however, an autonomous system might experience changes in system dynamics, control authority, external disturbances, and/or the surround… ▽ More

    Submitted 22 April, 2024; v1 submitted 29 September, 2022; originally announced September 2022.

  27. arXiv:2208.01800  [pdf, other

    cs.RO cs.MA

    Decentralized Learning With Limited Communications for Multi-robot Coverage of Unknown Spatial Fields

    Authors: Kensuke Nakamura, María Santos, Naomi Ehrich Leonard

    Abstract: This paper presents an algorithm for a team of mobile robots to simultaneously learn a spatial field over a domain and spatially distribute themselves to optimally cover it. Drawing from previous approaches that estimate the spatial field through a centralized Gaussian process, this work leverages the spatial structure of the coverage problem and presents a decentralized strategy where samples are… ▽ More

    Submitted 2 August, 2022; originally announced August 2022.

    Comments: Accepted IROS 2022

  28. Class-Difficulty Based Methods for Long-Tailed Visual Recognition

    Authors: Saptarshi Sinha, Hiroki Ohashi, Katsuyuki Nakamura

    Abstract: Long-tailed datasets are very frequently encountered in real-world use cases where few classes or categories (known as majority or head classes) have higher number of data samples compared to the other classes (known as minority or tail classes). Training deep neural networks on such datasets gives results biased towards the head classes. So far, researchers have come up with multiple weighted los… ▽ More

    Submitted 22 August, 2022; v1 submitted 29 July, 2022; originally announced July 2022.

    Comments: Published in IJCV. Paper URL: https://rdcu.be/cTWem

    Journal ref: International Journal of Computer Vision (2022)

  29. arXiv:2206.09575  [pdf, other

    cs.CV cs.AI cs.LG

    C-SENN: Contrastive Self-Explaining Neural Network

    Authors: Yoshihide Sawada, Keigo Nakamura

    Abstract: In this study, we use a self-explaining neural network (SENN), which learns unsupervised concepts, to acquire concepts that are easy for people to understand automatically. In concept learning, the hidden layer retains verbalizable features relevant to the output, which is crucial when adapting to real-world environments where explanations are required. However, it is known that the interpretabili… ▽ More

    Submitted 26 June, 2022; v1 submitted 20 June, 2022; originally announced June 2022.

    Comments: 10 pages

  30. arXiv:2205.15598  [pdf

    cs.LG cs.AI

    Individual health-disease phase diagrams for disease prevention based on machine learning

    Authors: Kazuki Nakamura, Eiichiro Uchino, Noriaki Sato, Ayano Araki, Kei Terayama, Ryosuke Kojima, Koichi Murashita, Ken Itoh, Tatsuya Mikami, Yoshinori Tamada, Yasushi Okuno

    Abstract: Early disease detection and prevention methods based on effective interventions are gaining attention. Machine learning technology has enabled precise disease prediction by capturing individual differences in multivariate data. Progress in precision medicine has revealed that substantial heterogeneity exists in health data at the individual level and that complex health factors are involved in the… ▽ More

    Submitted 7 July, 2022; v1 submitted 31 May, 2022; originally announced May 2022.

  31. arXiv:2204.13243  [pdf, other

    cs.CL

    HybriDialogue: An Information-Seeking Dialogue Dataset Grounded on Tabular and Textual Data

    Authors: Kai Nakamura, Sharon Levy, Yi-Lin Tuan, Wenhu Chen, William Yang Wang

    Abstract: A pressing challenge in current dialogue systems is to successfully converse with users on topics with information distributed across different modalities. Previous work in multiturn dialogue systems has primarily focused on either text or table information. In more realistic scenarios, having a joint understanding of both is critical as knowledge is typically distributed over both unstructured an… ▽ More

    Submitted 27 April, 2022; originally announced April 2022.

    Comments: Findings of ACL 2022

  32. arXiv:2202.08303  [pdf, other

    physics.med-ph cs.AI cs.CV

    OpenKBP-Opt: An international and reproducible evaluation of 76 knowledge-based planning pipelines

    Authors: Aaron Babier, Rafid Mahmood, Binghao Zhang, Victor G. L. Alves, Ana Maria Barragán-Montero, Joel Beaudry, Carlos E. Cardenas, Yankui Chang, Zijie Chen, Jaehee Chun, Kelly Diaz, Harold David Eraso, Erik Faustmann, Sibaji Gaj, Skylar Gay, Mary Gronberg, Bingqi Guo, Junjun He, Gerd Heilemann, Sanchit Hira, Yuliang Huang, Fuxin Ji, Dashan Jiang, Jean Carlo Jimenez Giraldo, Hoyeon Lee , et al. (34 additional authors not shown)

    Abstract: We establish an open framework for developing plan optimization models for knowledge-based planning (KBP) in radiotherapy. Our framework includes reference plans for 100 patients with head-and-neck cancer and high-quality dose predictions from 19 KBP models that were developed by different research groups during the OpenKBP Grand Challenge. The dose predictions were input to four optimization mode… ▽ More

    Submitted 16 February, 2022; originally announced February 2022.

    Comments: 19 pages, 7 tables, 6 figures

  33. arXiv:2202.01459  [pdf, other

    cs.CV cs.AI cs.LG

    Concept Bottleneck Model with Additional Unsupervised Concepts

    Authors: Yoshihide Sawada, Keigo Nakamura

    Abstract: With the increasing demands for accountability, interpretability is becoming an essential capability for real-world AI applications. However, most methods utilize post-hoc approaches rather than training the interpretable model. In this article, we propose a novel interpretable model based on the concept bottleneck model (CBM). CBM uses concept labels to train an intermediate layer as the addition… ▽ More

    Submitted 3 February, 2022; originally announced February 2022.

    Comments: 13 pages, 6 figures

  34. arXiv:2111.10210  [pdf, other

    stat.CO cs.LG stat.AP stat.ML

    The Application of Zig-Zag Sampler in Sequential Markov Chain Monte Carlo

    Authors: Yu Han, Kazuyuki Nakamura

    Abstract: Particle filtering methods are widely applied in sequential state estimation within nonlinear non-Gaussian state space model. However, the traditional particle filtering methods suffer the weight degeneracy in the high-dimensional state space model. Currently, there are many methods to improve the performance of particle filtering in high-dimensional state space model. Among these, the more advanc… ▽ More

    Submitted 17 November, 2021; originally announced November 2021.

  35. arXiv:2110.05184  [pdf, other

    cs.DM

    Solving Rep-tile by Computers: Performance of Solvers and Analyses of Solutions

    Authors: Mutsunori Banbara, Kenji Hashimoto, Takashi Horiyama, Shin-ichi Minato, Kakeru Nakamura, Masaaki Nishino, Masahiko Sakai, Ryuhei Uehara, Yushi Uno, Norihito Yasuda

    Abstract: A rep-tile is a polygon that can be dissected into smaller copies (of the same size) of the original polygon. A polyomino is a polygon that is formed by joining one or more unit squares edge to edge. These two notions were first introduced and investigated by Solomon W. Golomb in the 1950s and popularized by Martin Gardner in the 1960s. Since then, dozens of studies have been made in communities o… ▽ More

    Submitted 7 October, 2021; originally announced October 2021.

    Comments: 14 pages, 12 figures

  36. arXiv:2110.01773  [pdf, other

    cs.GT cs.DS cs.LG math.OC

    Differentiable Equilibrium Computation with Decision Diagrams for Stackelberg Models of Combinatorial Congestion Games

    Authors: Shinsaku Sakaue, Kengo Nakamura

    Abstract: We address Stackelberg models of combinatorial congestion games (CCGs); we aim to optimize the parameters of CCGs so that the selfish behavior of non-atomic players attains desirable equilibria. This model is essential for designing such social infrastructures as traffic and communication networks. Nevertheless, computational approaches to the model have not been thoroughly studied due to two diff… ▽ More

    Submitted 17 October, 2021; v1 submitted 4 October, 2021; originally announced October 2021.

  37. arXiv:2110.00843  [pdf, other

    cs.RO cs.LG math.OC

    SHARP: Shielding-Aware Robust Planning for Safe and Efficient Human-Robot Interaction

    Authors: Haimin Hu, Kensuke Nakamura, Jaime F. Fisac

    Abstract: Jointly achieving safety and efficiency in human-robot interaction (HRI) settings is a challenging problem, as the robot's planning objectives may be at odds with the human's own intent and expectations. Recent approaches ensure safe robot operation in uncertain environments through a supervisory control scheme, sometimes called "shielding", which overrides the robot's nominal plan with a safety f… ▽ More

    Submitted 10 March, 2022; v1 submitted 2 October, 2021; originally announced October 2021.

  38. Sensor-Augmented Egocentric-Video Captioning with Dynamic Modal Attention

    Authors: Katsuyuki Nakamura, Hiroki Ohashi, Mitsuhiro Okada

    Abstract: Automatically describing video, or video captioning, has been widely studied in the multimedia field. This paper proposes a new task of sensor-augmented egocentric-video captioning, a newly constructed dataset for it called MMAC Captions, and a method for the newly proposed task that effectively utilizes multi-modal data of video and motion sensors, or inertial measurement units (IMUs). While conv… ▽ More

    Submitted 7 September, 2021; originally announced September 2021.

    Comments: Accepted to ACM Multimedia (ACMMM) 2021

  39. arXiv:2105.00220  [pdf, other

    cs.LG cs.CV

    Generative Adversarial Networks via a Composite Annealing of Noise and Diffusion

    Authors: Kensuke Nakamura, Simon Korman, Byung-Woo Hong

    Abstract: Generative adversarial network (GAN) is a framework for generating fake data using a set of real examples. However, GAN is unstable in the training stage. In order to stabilize GANs, the noise injection has been used to enlarge the overlap of the real and fake distributions at the cost of increasing variance. The diffusion (or smoothing) may reduce the intrinsic underlying dimensionality of data b… ▽ More

    Submitted 31 July, 2022; v1 submitted 1 May, 2021; originally announced May 2021.

  40. arXiv:2104.06600  [pdf, other

    cs.LG cs.AI cs.RO

    GAN-Based Interactive Reinforcement Learning from Demonstration and Human Evaluative Feedback

    Authors: Jie Huang, Rongshun Juan, Randy Gomez, Keisuke Nakamura, Qixin Sha, Bo He, Guangliang Li

    Abstract: Deep reinforcement learning (DRL) has achieved great successes in many simulated tasks. The sample inefficiency problem makes applying traditional DRL methods to real-world robots a great challenge. Generative Adversarial Imitation Learning (GAIL) -- a general model-free imitation learning method, allows robots to directly learn policies from expert trajectories in large environments. However, GAI… ▽ More

    Submitted 13 April, 2021; originally announced April 2021.

  41. arXiv:2012.11073  [pdf, ps, other

    cs.LG

    Regularization in network optimization via trimmed stochastic gradient descent with noisy label

    Authors: Kensuke Nakamura, Bong-Soo Sohn, Kyoung-Jae Won, Byung-Woo Hong

    Abstract: Regularization is essential for avoiding over-fitting to training data in network optimization, leading to better generalization of the trained networks. The label noise provides a strong implicit regularization by replacing the target ground truth labels of training examples by uniform random labels. However, it can cause undesirable misleading gradients due to the large loss associated with inco… ▽ More

    Submitted 2 May, 2022; v1 submitted 20 December, 2020; originally announced December 2020.

  42. arXiv:2010.16087  [pdf

    cs.LG cs.AI stat.ML

    Health improvement framework for planning actionable treatment process using surrogate Bayesian model

    Authors: Kazuki Nakamura, Ryosuke Kojima, Eiichiro Uchino, Koichi Murashita, Ken Itoh, Shigeyuki Nakaji, Yasushi Okuno

    Abstract: Clinical decision making regarding treatments based on personal characteristics leads to effective health improvements. Machine learning (ML) has been the primary concern of diagnosis support according to comprehensive patient information. However, the remaining prominent issue is the development of objective treatment processes in clinical situations. This study proposes a novel framework to plan… ▽ More

    Submitted 13 November, 2020; v1 submitted 30 October, 2020; originally announced October 2020.

  43. arXiv:2010.08729  [pdf, other

    stat.ML cs.LG stat.CO

    Ensemble Kalman Variational Objectives: Nonlinear Latent Trajectory Inference with A Hybrid of Variational Inference and Ensemble Kalman Filter

    Authors: Tsuyoshi Ishizone, Tomoyuki Higuchi, Kazuyuki Nakamura

    Abstract: Variational inference (VI) combined with Bayesian nonlinear filtering produces state-of-the-art results for latent time-series modeling. A body of recent work has focused on sequential Monte Carlo (SMC) and its variants, e.g., forward filtering backward simulation (FFBSi). Although these studies have succeeded, serious problems remain in particle degeneracy and biased gradient estimators. In this… ▽ More

    Submitted 9 November, 2021; v1 submitted 17 October, 2020; originally announced October 2020.

  44. arXiv:2010.01824  [pdf, other

    cs.CV cs.AI

    Class-Wise Difficulty-Balanced Loss for Solving Class-Imbalance

    Authors: Saptarshi Sinha, Hiroki Ohashi, Katsuyuki Nakamura

    Abstract: Class-imbalance is one of the major challenges in real world datasets, where a few classes (called majority classes) constitute much more data samples than the rest (called minority classes). Learning deep neural networks using such datasets leads to performances that are typically biased towards the majority classes. Most of the prior works try to solve class-imbalance by assigning more weights t… ▽ More

    Submitted 5 October, 2020; originally announced October 2020.

    Comments: Accepted for ACCV 2020 oral presentation

  45. arXiv:2009.13798  [pdf, other

    eess.IV cs.AI

    Automatic Segmentation, Localization, and Identification of Vertebrae in 3D CT Images Using Cascaded Convolutional Neural Networks

    Authors: Naoto Masuzawa, Yoshiro Kitamura, Keigo Nakamura, Satoshi Iizuka, Edgar Simo-Serra

    Abstract: This paper presents a method for automatic segmentation, localization, and identification of vertebrae in arbitrary 3D CT images. Many previous works do not perform the three tasks simultaneously even though requiring a priori knowledge of which part of the anatomy is visible in the 3D CT images. Our method tackles all these tasks in a single multi-stage framework without any assumptions. In the f… ▽ More

    Submitted 29 September, 2020; originally announced September 2020.

  46. arXiv:2006.01233  [pdf, other

    cs.RO

    Hibikino-Musashi@Home 2019 Team Description Paper

    Authors: Yuichiro Tanaka, Yutaro Ishida, Yushi Abe, Tomohiro Ono, Kohei Kabashima, Takuma Sakata, Masashi Fukuyado, Fuyuki Muto, Takumi Yoshii, Kazuki Kanamaru, Daichi Kamimura, Kentaro Nakamura, Yuta Nishimura, Takashi Morie, Hakaru Tamukoh

    Abstract: Our team, Hibikino-Musashi@Home (HMA), was founded in 2010. It is based in the Kitakyushu Science and Research Park, Japan. Since 2010, we have participated in the RoboCup@Home Japan Open competition open platform league annually. We have also participated in the RoboCup 2017 Nagoya as an open platform league and domestic standard platform league teams, and in the RoboCup 2018 Montreal as a domest… ▽ More

    Submitted 29 May, 2020; originally announced June 2020.

    Comments: 9pages, 5 figures, RoboCup 2019. arXiv admin note: substantial text overlap with arXiv:2005.14451

  47. arXiv:2005.14451  [pdf, other

    cs.RO

    Hibikino-Musashi@Home 2020 Team Description Paper

    Authors: Tomohiro Ono, Yuichiro Tanaka, Yutaro Ishida, Yushi Abe, Kazuki Kanamaru, Daichi Kamimura, Kentaro Nakamura, Yuta Nishimura, Shoshi Tokuno, Yuya Mii, Morio Yamauchi, Yuichiro Uemura, Takunori Hashimoto, Yugo Nakamura, Issei Uchino, Daiju Kanaoka, Takeru Hanyu, Kenta Tsukamoto, Takashi Morie, Hakaru Tamukoh

    Abstract: Our team, Hibikino-Musashi@Home (HMA), was founded in 2010. It is based in Japan in the Kitakyushu Science and Research Park. Since 2010, we have annually participated in the RoboCup@Home Japan Open competition in the open platform league (OPL). We participated as an open platform league team in the 2017 Nagoya RoboCup competition and as a domestic standard platform league (DSPL) team in the 2017… ▽ More

    Submitted 29 May, 2020; originally announced May 2020.

    Comments: 10 pages, 7 figures, RoboCup 2020

  48. arXiv:2004.14003  [pdf, other

    eess.IV cs.CV

    The International Workshop on Osteoarthritis Imaging Knee MRI Segmentation Challenge: A Multi-Institute Evaluation and Analysis Framework on a Standardized Dataset

    Authors: Arjun D. Desai, Francesco Caliva, Claudia Iriondo, Naji Khosravan, Aliasghar Mortazi, Sachin Jambawalikar, Drew Torigian, Jutta Ellermann, Mehmet Akcakaya, Ulas Bagci, Radhika Tibrewala, Io Flament, Matthew O`Brien, Sharmila Majumdar, Mathias Perslev, Akshay Pai, Christian Igel, Erik B. Dam, Sibaji Gaj, Mingrui Yang, Kunio Nakamura, Xiaojuan Li, Cem M. Deniz, Vladimir Juras, Ravinder Regatte , et al. (4 additional authors not shown)

    Abstract: Purpose: To organize a knee MRI segmentation challenge for characterizing the semantic and clinical efficacy of automatic segmentation methods relevant for monitoring osteoarthritis progression. Methods: A dataset partition consisting of 3D knee MRI from 88 subjects at two timepoints with ground-truth articular (femoral, tibial, patellar) cartilage and meniscus segmentations was standardized. Ch… ▽ More

    Submitted 26 May, 2020; v1 submitted 29 April, 2020; originally announced April 2020.

    Comments: Submitted to Radiology: Artificial Intelligence; Fixed typos

  49. arXiv:2004.06341  [pdf, ps, other

    cs.LG cs.CV stat.ML

    Stochastic batch size for adaptive regularization in deep network optimization

    Authors: Kensuke Nakamura, Stefano Soatto, Byung-Woo Hong

    Abstract: We propose a first-order stochastic optimization algorithm incorporating adaptive regularization applicable to machine learning problems in deep learning framework. The adaptive regularization is imposed by stochastic process in determining batch size for each model parameter at each optimization iteration. The stochastic batch size is determined by the update probability of each parameter followi… ▽ More

    Submitted 14 April, 2020; originally announced April 2020.

  50. arXiv:2004.05461  [pdf, other

    cs.CE cs.LG

    Deep learning-based topological optimization for representing a user-specified design area

    Authors: Keigo Nakamura, Yoshiro Suzuki

    Abstract: Presently, topology optimization requires multiple iterations to create an optimized structure for given conditions. Among the conditions for topology optimization,the design area is one of the most important for structural design. In this study, we propose a new deep learning model to generate an optimized structure for a given design domain and other boundary conditions without iteration. For th… ▽ More

    Submitted 19 April, 2020; v1 submitted 11 April, 2020; originally announced April 2020.

    Comments: 12 pages, 16 figures