Skip to main content

Showing 1–50 of 66 results for author: Suzuki, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2507.03946  [pdf, ps, other

    cs.GT

    Fair and Efficient Allocation of Indivisible Mixed Manna

    Authors: Siddharth Barman, Vishwa Prakash HV, Aditi Sethia, Mashbat Suzuki

    Abstract: We study fair division of indivisible mixed manna (items whose values may be positive, negative, or zero) among agents with additive valuations. Here, we establish that fairness -- in terms of a relaxation of envy-freeness -- and Pareto efficiency can always be achieved together. Specifically, our fairness guarantees are in terms of envy-freeness up to $k$ reallocations (EFR-$k$): An allocation… ▽ More

    Submitted 5 July, 2025; originally announced July 2025.

    Comments: 31 pages

  2. arXiv:2506.22881  [pdf, ps, other

    cs.CV

    How Semantically Informative is an Image?: Measuring the Covariance-Weighted Norm of Contrastive Learning Embeddings

    Authors: Fumiya Uchiyama, Rintaro Yanagi, Shohei Taniguchi, Shota Takashiro, Masahiro Suzuki, Hirokatsu Kataoka, Yusuke Iwasawa, Yutaka Matsuo

    Abstract: Contrastive learning has the capacity to model multimodal probability distributions by embedding and aligning visual representations with semantics from captions. This approach enables the estimation of relational semantic similarity; however, it remains unclear whether it can also represent absolute semantic informativeness. In this work, we introduce a semantic informativeness metric for an imag… ▽ More

    Submitted 28 June, 2025; originally announced June 2025.

  3. arXiv:2506.20164  [pdf

    q-bio.NC cs.AI

    Do psychic cells generate consciousness?

    Authors: Mototaka Suzuki, Jaan Aru

    Abstract: Technological advances in the past decades have begun to enable neuroscientists to address fundamental questions about consciousness in an unprecedented way. Here we review remarkable recent progress in our understanding of cellular-level mechanisms of conscious processing in the brain. Of particular interest are the cortical pyramidal neurons -- or "psychic cells" called by Ramón y Cajal more tha… ▽ More

    Submitted 25 June, 2025; originally announced June 2025.

  4. arXiv:2506.14046  [pdf, ps, other

    cs.CL cs.AI

    Ace-CEFR -- A Dataset for Automated Evaluation of the Linguistic Difficulty of Conversational Texts for LLM Applications

    Authors: David Kogan, Max Schumacher, Sam Nguyen, Masanori Suzuki, Melissa Smith, Chloe Sophia Bellows, Jared Bernstein

    Abstract: There is an unmet need to evaluate the language difficulty of short, conversational passages of text, particularly for training and filtering Large Language Models (LLMs). We introduce Ace-CEFR, a dataset of English conversational text passages expert-annotated with their corresponding level of text difficulty. We experiment with several models on Ace-CEFR, including Transformer-based models and L… ▽ More

    Submitted 16 June, 2025; originally announced June 2025.

  5. arXiv:2503.06138  [pdf, other

    cs.AI cs.RO q-bio.NC

    System 0/1/2/3: Quad-process theory for multi-timescale embodied collective cognitive systems

    Authors: Tadahiro Taniguchi, Yasushi Hirai, Masahiro Suzuki, Shingo Murata, Takato Horii, Kazutoshi Tanaka

    Abstract: This paper introduces the System 0/1/2/3 framework as an extension of dual-process theory, employing a quad-process model of cognition. Expanding upon System 1 (fast, intuitive thinking) and System 2 (slow, deliberative thinking), we incorporate System 0, which represents pre-cognitive embodied processes, and System 3, which encompasses collective intelligence and symbol emergence. We contextualiz… ▽ More

    Submitted 13 March, 2025; v1 submitted 8 March, 2025; originally announced March 2025.

    Comments: Under review

  6. arXiv:2503.00885  [pdf, ps, other

    cs.GT

    Social Welfare Maximization in Approval-Based Committee Voting under Uncertainty

    Authors: Haris Aziz, Yuhang Guo, Venkateswara Rao Kagita, Baharak Rastegari, Mashbat Suzuki

    Abstract: Approval voting is widely used for making multi-winner voting decisions. The canonical rule (also called Approval Voting) used in the setting aims to maximize social welfare by selecting candidates with the highest number of approvals. We revisit approval-based multi-winner voting in scenarios where the information regarding the voters' preferences is uncertain. We present several algorithmic resu… ▽ More

    Submitted 2 March, 2025; originally announced March 2025.

  7. arXiv:2502.17869  [pdf, ps, other

    cs.GT

    Maximum Welfare Allocations under Quantile Valuations

    Authors: Haris Aziz, Shivika Narang, Mashbat Suzuki

    Abstract: We propose a new model for aggregating preferences over a set of indivisible items based on a quantile value. In this model, each agent is endowed with a specific quantile, and the value of a given bundle is defined by the corresponding quantile of the individual values of the items within it. Our model captures the diverse ways in which agents may perceive a bundle, even when they agree on the va… ▽ More

    Submitted 17 April, 2025; v1 submitted 25 February, 2025; originally announced February 2025.

  8. arXiv:2502.13671  [pdf, other

    cs.GT

    On the Subsidy of Envy-Free Orientations in Graphs

    Authors: Bo Li, Ankang Sun, Mashbat Suzuki, Shiji Xing

    Abstract: We study a fair division problem in (multi)graphs where $n$ agents (vertices) are pairwise connected by items (edges), and each agent is only interested in its incident items. We consider how to allocate items to incident agents in an envy-free manner, i.e., envy-free orientations, while minimizing the overall payment, i.e., subsidy. We first prove that computing an envy-free orientation with the… ▽ More

    Submitted 19 February, 2025; originally announced February 2025.

  9. arXiv:2502.09006  [pdf, ps, other

    cs.GT

    Whoever Said Money Won't Solve All Your Problems? Weighted Envy-free Allocation with Subsidy

    Authors: Noga Klein Elmalem, Haris Aziz, Rica Gonen, Xin Huang, Kei Kimura, Indrajit Saha, Erel Segal-Halevi, Zhaohong Sun, Mashbat Suzuki, Makoto Yokoo

    Abstract: We explore solutions for fairly allocating indivisible items among agents assigned weights representing their entitlements. Our fairness goal is weighted-envy-freeness (WEF), where each agent deems their allocated portion relative to their entitlement at least as favorable as any others relative to their own. Often, achieving WEF necessitates monetary transfers, which can be modeled as third-party… ▽ More

    Submitted 5 March, 2025; v1 submitted 13 February, 2025; originally announced February 2025.

    Comments: 60 pages, 5 tables. arXiv admin note: substantial text overlap with arXiv:2411.12696

  10. arXiv:2501.19252  [pdf, ps, other

    cs.CV

    Inference-Time Text-to-Video Alignment with Diffusion Latent Beam Search

    Authors: Yuta Oshima, Masahiro Suzuki, Yutaka Matsuo, Hiroki Furuta

    Abstract: The remarkable progress in text-to-video diffusion models enables photorealistic generations, although the contents of the generated video often include unnatural movement or deformation, reverse playback, and motionless scenes. Recently, an alignment problem has attracted huge attention, where we steer the output of diffusion models based on some quantity on the goodness of the content. Because t… ▽ More

    Submitted 1 June, 2025; v1 submitted 31 January, 2025; originally announced January 2025.

    Comments: Code: https://github.com/shim0114/T2V-Diffusion-Search

  11. arXiv:2501.00226  [pdf, other

    cs.AI cs.CL

    Generative Emergent Communication: Large Language Model is a Collective World Model

    Authors: Tadahiro Taniguchi, Ryo Ueda, Tomoaki Nakamura, Masahiro Suzuki, Akira Taniguchi

    Abstract: This study proposes a unifying theoretical framework called generative emergent communication (generative EmCom) that bridges emergent communication, world models, and large language models (LLMs) through the lens of collective predictive coding (CPC). The proposed framework formalizes the emergence of language and symbol systems through decentralized Bayesian inference across multiple agents, ext… ▽ More

    Submitted 30 December, 2024; originally announced January 2025.

  12. arXiv:2412.05630  [pdf, other

    cs.CE

    Dislocation-based crystal plasticity simulation on grain-size dependence of mechanical properties in dual-phase steels

    Authors: Misato Suzuki, Mayu Muramatsu, Kazuyuki Shizawa

    Abstract: In this study, the effect of ferrite grain size on the mechanical properties and dislocation behavior of dual-phase (DP) steel is investigated using dislocation-based crystal plasticity finite element analysis. DP steel, composed of a soft ferritic phase and a hard martensitic phase, shows mechanical properties that are significantly influenced by ferrite grain size. The mechanism underlying this… ▽ More

    Submitted 7 December, 2024; originally announced December 2024.

    Comments: 16 pages, 14figures

  13. arXiv:2412.02435  [pdf, ps, other

    cs.GT econ.TH

    Approximately Fair and Population Consistent Budget Division via Simple Payment Schemes

    Authors: Haris Aziz, Patrick Lederer, Xinhang Lu, Mashbat Suzuki, Jeremy Vollen

    Abstract: In approval-based budget division, a budget needs to be distributed to candidates based on the voters' approval ballots over these candidates. In the pursuit of a simple, consistent, and approximately fair rule for this setting, we introduce the maximum payment rule (MP). Under this rule, each voter controls a part of the budget and, in each step, the corresponding voters allocate their entire bud… ▽ More

    Submitted 2 July, 2025; v1 submitted 3 December, 2024; originally announced December 2024.

    Comments: This paper (version 1) has been accepted at EC'25. The current version is in preparation for a revision at a journal, which caused significant changes in the presentation of the results and lead to a new title

  14. arXiv:2411.09937  [pdf, other

    cs.CL q-fin.CP

    Refined and Segmented Price Sentiment Indices from Survey Comments

    Authors: Masahiro Suzuki, Hiroki Sakaji

    Abstract: We aim to enhance a price sentiment index and to more precisely understand price trends from the perspective of not only consumers but also businesses. We extract comments related to prices from the Economy Watchers Survey conducted by the Cabinet Office of Japan and classify price trends using a large language model (LLM). We classify whether the survey sample reflects the perspective of consumer… ▽ More

    Submitted 26 November, 2024; v1 submitted 14 November, 2024; originally announced November 2024.

    Comments: Accepted to IEEE BigData 2024. 9 pages, 11 tables, 1 figure

  15. arXiv:2411.02853  [pdf, other

    cs.LG stat.ML

    ADOPT: Modified Adam Can Converge with Any $β_2$ with the Optimal Rate

    Authors: Shohei Taniguchi, Keno Harada, Gouki Minegishi, Yuta Oshima, Seong Cheol Jeong, Go Nagahara, Tomoshi Iiyama, Masahiro Suzuki, Yusuke Iwasawa, Yutaka Matsuo

    Abstract: Adam is one of the most popular optimization algorithms in deep learning. However, it is known that Adam does not converge in theory unless choosing a hyperparameter, i.e., $β_2$, in a problem-dependent manner. There have been many attempts to fix the non-convergence (e.g., AMSGrad), but they require an impractical assumption that the gradient noise is uniformly bounded. In this paper, we propose… ▽ More

    Submitted 21 November, 2024; v1 submitted 5 November, 2024; originally announced November 2024.

    Comments: Accepted at Neural Information Processing Systems (NeurIPS 2024)

  16. arXiv:2410.20822  [pdf, other

    cs.CE

    Conditional diffusion model for inverse prediction of process parameters and dendritic microstructures from mechanical properties

    Authors: Arisa Ikeda, Ryo Higuchi, Tomohiro Yokozeki, Katsuhiro Endo, Yuta Kojima, Misato Suzuki, Mayu Muramatsu

    Abstract: In this study, we develop a conditional diffusion model that proposes the optimal process parameters and predicts the microstructure for the desired mechanical properties. In materials development, it is costly to try many samples with different parameters in experiments and numerical simulations. The use of data-driven inverse design method can reduce the cost of materials development. This study… ▽ More

    Submitted 14 March, 2025; v1 submitted 28 October, 2024; originally announced October 2024.

    Comments: 22pages, 22figures

  17. arXiv:2410.15728  [pdf, other

    cs.CV cs.LG

    Object-Centric Temporal Consistency via Conditional Autoregressive Inductive Biases

    Authors: Cristian Meo, Akihiro Nakano, Mircea Lică, Aniket Didolkar, Masahiro Suzuki, Anirudh Goyal, Mengmi Zhang, Justin Dauwels, Yutaka Matsuo, Yoshua Bengio

    Abstract: Unsupervised object-centric learning from videos is a promising approach towards learning compositional representations that can be applied to various downstream tasks, such as prediction and reasoning. Recently, it was shown that pretrained Vision Transformers (ViTs) can be useful to learn object-centric representations on real-world video datasets. However, while these approaches succeed at extr… ▽ More

    Submitted 21 October, 2024; originally announced October 2024.

  18. arXiv:2410.11403  [pdf, other

    cs.LG cs.AI

    Enhancing Unimodal Latent Representations in Multimodal VAEs through Iterative Amortized Inference

    Authors: Yuta Oshima, Masahiro Suzuki, Yutaka Matsuo

    Abstract: Multimodal variational autoencoders (VAEs) aim to capture shared latent representations by integrating information from different data modalities. A significant challenge is accurately inferring representations from any subset of modalities without training an impractical number (2^M) of inference networks for all possible modality combinations. Mixture-based models simplify this by requiring only… ▽ More

    Submitted 15 October, 2024; originally announced October 2024.

    Comments: 22 pages, 12 figures

  19. arXiv:2408.12326  [pdf, other

    cs.CL cs.AI cs.CE cs.CY

    Interactive DualChecker for Mitigating Hallucinations in Distilling Large Language Models

    Authors: Meiyun Wang, Masahiro Suzuki, Hiroki Sakaji, Kiyoshi Izumi

    Abstract: Large Language Models (LLMs) have demonstrated exceptional capabilities across various machine learning (ML) tasks. Given the high costs of creating annotated datasets for supervised learning, LLMs offer a valuable alternative by enabling effective few-shot in-context learning. However, these models can produce hallucinations, particularly in domains with incomplete knowledge. Additionally, curren… ▽ More

    Submitted 22 August, 2024; originally announced August 2024.

  20. arXiv:2408.08711  [pdf, other

    cs.GT

    Weighted Envy-free Allocation with Subsidy

    Authors: Haris Aziz, Xin Huang, Kei Kimura, Indrajit Saha, Zhaohong Sun, Mashbat Suzuki, Makoto Yokoo

    Abstract: We consider the problem of fair allocation of indivisible items with subsidies when agents have weighted entitlements. After highlighting several important differences from the unweighted case, we present several results concerning weighted envy-freeability including general characterizations, algorithms for achieving and testing weighted envy-freeability, lower and upper bounds of the amount of s… ▽ More

    Submitted 17 October, 2024; v1 submitted 16 August, 2024; originally announced August 2024.

    Comments: 26 pages, 1 Table, 1 Figure

  21. arXiv:2407.19391  [pdf, ps, other

    cs.GT

    Approval-Based Committee Voting under Uncertainty

    Authors: Hariz Aziz, Venkateswara Rao Kagita, Baharak Rastegari, Mashbat Suzuki

    Abstract: We study approval-based committee voting in which a target number of candidates are selected based on voters' approval preferences over candidates. In contrast to most of the work, we consider the setting where voters express uncertain approval preferences and explore four different types of uncertain approval preference models. For each model, we study the problems such as computing a committee w… ▽ More

    Submitted 28 July, 2024; originally announced July 2024.

  22. arXiv:2407.14727  [pdf, ps, other

    cs.CL cs.CE

    Economy Watchers Survey Provides Datasets and Tasks for Japanese Financial Domain

    Authors: Masahiro Suzuki, Hiroki Sakaji

    Abstract: Natural language processing (NLP) tasks in English and general domains are widely available and are often used to evaluate pre-trained language models. In contrast, fewer tasks are available for languages other than English and in the financial domain. Particularly, tasks in the Japanese and financial domains are limited. We develop two large datasets using data published by a Japanese central gov… ▽ More

    Submitted 1 February, 2025; v1 submitted 19 July, 2024; originally announced July 2024.

    Comments: Accepted to the ACM Web Conference 2025. 4 pages

  23. arXiv:2407.13300  [pdf, other

    cs.CL eess.AS

    Robust ASR Error Correction with Conservative Data Filtering

    Authors: Takuma Udagawa, Masayuki Suzuki, Masayasu Muraoka, Gakuto Kurata

    Abstract: Error correction (EC) based on large language models is an emerging technology to enhance the performance of automatic speech recognition (ASR) systems. Generally, training data for EC are collected by automatically pairing a large set of ASR hypotheses (as sources) and their gold references (as targets). However, the quality of such pairs is not guaranteed, and we observed various types of noise… ▽ More

    Submitted 16 October, 2024; v1 submitted 18 July, 2024; originally announced July 2024.

    Comments: Accepted to EMNLP 2024 Industry Track

  24. arXiv:2407.13171  [pdf, other

    cs.GT

    Maximin Fair Allocation of Indivisible Items under Cost Utilities

    Authors: Sirin Botan, Angus Ritossa, Mashbat Suzuki, Toby Walsh

    Abstract: We study the problem of fairly allocating indivisible goods among a set of agents. Our focus is on the existence of allocations that give each agent their maximin fair share--the value they are guaranteed if they divide the goods into as many bundles as there are agents, and receive their lowest valued bundle. An MMS allocation is one where every agent receives at least their maximin fair share. W… ▽ More

    Submitted 18 July, 2024; originally announced July 2024.

    Comments: Appeared in SAGT 2023

  25. arXiv:2407.12461  [pdf, other

    cs.GT

    Compatibility of Fairness and Nash Welfare under Subadditive Valuations

    Authors: Siddharth Barman, Mashbat Suzuki

    Abstract: We establish a compatibility between fairness and efficiency, captured via Nash Social Welfare (NSW), under the broad class of subadditive valuations. We prove that, for subadditive valuations, there always exists a partial allocation that is envy-free up to the removal of any good (EFx) and has NSW at least half of the optimal; here, optimality is considered across all allocations, fair or otherw… ▽ More

    Submitted 4 March, 2025; v1 submitted 17 July, 2024; originally announced July 2024.

    Comments: 23 pages

  26. arXiv:2407.05240  [pdf, other

    cs.GT

    Neighborhood Stability in Assignments on Graphs

    Authors: Haris Aziz, Grzegorz Lisowski, Mashbat Suzuki, Jeremy Vollen

    Abstract: We study the problem of assigning agents to the vertices of a graph such that no pair of neighbors can benefit from swapping assignments -- a property we term neighborhood stability. We further assume that agents' utilities are based solely on their preferences over the assignees of adjacent vertices and that those preferences are binary. Having shown that even this very restricted setting does no… ▽ More

    Submitted 6 July, 2024; originally announced July 2024.

  27. arXiv:2406.14907  [pdf, other

    cs.GT econ.TH

    Maximum Flow is Fair: A Network Flow Approach to Committee Voting

    Authors: Mashbat Suzuki, Jeremy Vollen

    Abstract: In the committee voting setting, a subset of $k$ alternatives is selected based on the preferences of voters. In this paper, our goal is to efficiently compute $\textit{ex-ante}$ fair probability distributions over committees. We introduce a new axiom called $\textit{group resource proportionality}$, which strengthens other fairness notions in the literature. We characterize our fairness axiom by… ▽ More

    Submitted 27 December, 2024; v1 submitted 21 June, 2024; originally announced June 2024.

    Comments: Previous version appeared in EC 2024. This version features significant additional content; notably, treatment of excludable strategyproofness and modified motivation surrounding fractional core. The latter change was made as we realized that the definition of fractional core was misstated in our previous version. In the current manuscript, our old definition has become strict fractional core

  28. arXiv:2406.00765  [pdf

    cs.AI cs.CL

    The Embodied World Model Based on LLM with Visual Information and Prediction-Oriented Prompts

    Authors: Wakana Haijima, Kou Nakakubo, Masahiro Suzuki, Yutaka Matsuo

    Abstract: In recent years, as machine learning, particularly for vision and language understanding, has been improved, research in embedded AI has also evolved. VOYAGER is a well-known LLM-based embodied AI that enables autonomous exploration in the Minecraft world, but it has issues such as underutilization of visual data and insufficient functionality as a world model. In this research, the possibility of… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

  29. Investigation on optimal microstructure of dual-phase steel with high strength and ductility by machine learning

    Authors: Misato Suzuki, Kazuyuki Shizawa, Mayu Muramatsu

    Abstract: In this study, we developed an inverse analysis framework that proposes a microstructure for dual-phase (DP) steel that exhibits high strength and ductility. The inverse analysis method proposed in this study involves repeated random searches on a model that combines a generative adversarial network (GAN), which generates microstructures, and a convolutional neural network (CNN), which predicts th… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

    Comments: 27 pages, 23 figures

    Journal ref: Mater. Today Commun., Volume 41, 110557, 2024

  30. arXiv:2404.09260  [pdf, other

    cs.CL cs.CE

    JaFIn: Japanese Financial Instruction Dataset

    Authors: Kota Tanabe, Masahiro Suzuki, Hiroki Sakaji, Itsuki Noda

    Abstract: We construct an instruction dataset for the large language model (LLM) in the Japanese finance domain. Domain adaptation of language models, including LLMs, is receiving more attention as language models become more popular. This study demonstrates the effectiveness of domain adaptation through instruction tuning. To achieve this, we propose an instruction tuning data in Japanese called JaFIn, the… ▽ More

    Submitted 19 July, 2024; v1 submitted 14 April, 2024; originally announced April 2024.

    Comments: 10 pages, 1 figure. The paper is a camera-ready version for the 2024 IEEE Symposium on Computational Intelligence for Financial Engineering and Economics (CIFEr)

  31. arXiv:2404.05198  [pdf, ps, other

    cs.GT

    Fair Lotteries for Participatory Budgeting

    Authors: Haris Aziz, Xinhang Lu, Mashbat Suzuki, Jeremy Vollen, Toby Walsh

    Abstract: In pursuit of participatory budgeting (PB) outcomes with broader fairness guarantees, we initiate the study of lotteries over discrete PB outcomes. As the projects have heterogeneous costs, the amount spent may not be equal ex ante and ex post. To address this, we develop a technique to bound the amount by which the ex-post spend differs from the ex-ante spend -- the property is termed budget bala… ▽ More

    Submitted 11 April, 2024; v1 submitted 8 April, 2024; originally announced April 2024.

    Comments: Appears in the 38th AAAI Conference on Artificial Intelligence (AAAI), 2024

  32. arXiv:2403.07711  [pdf, other

    cs.CV cs.AI

    SSM Meets Video Diffusion Models: Efficient Long-Term Video Generation with Structured State Spaces

    Authors: Yuta Oshima, Shohei Taniguchi, Masahiro Suzuki, Yutaka Matsuo

    Abstract: Given the remarkable achievements in image generation through diffusion models, the research community has shown increasing interest in extending these models to video generation. Recent diffusion models for video generation have predominantly utilized attention layers to extract temporal features. However, attention layers are limited by their computational costs, which increase quadratically wit… ▽ More

    Submitted 3 September, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

    Comments: Accepted as a workshop paper at ICLR 2024

  33. arXiv:2402.14484  [pdf, other

    cs.CL

    Is ChatGPT the Future of Causal Text Mining? A Comprehensive Evaluation and Analysis

    Authors: Takehiro Takayanagi, Masahiro Suzuki, Ryotaro Kobayashi, Hiroki Sakaji, Kiyoshi Izumi

    Abstract: Causality is fundamental in human cognition and has drawn attention in diverse research fields. With growing volumes of textual data, discerning causalities within text data is crucial, and causal text mining plays a pivotal role in extracting meaningful patterns. This study conducts comprehensive evaluations of ChatGPT's causal text mining capabilities. Firstly, we introduce a benchmark that exte… ▽ More

    Submitted 23 February, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

  34. arXiv:2312.11286  [pdf, ps, other

    cs.GT

    Envy-free House Allocation under Uncertain Preferences

    Authors: Haris Aziz, Isaiah Iliffe, Bo Li, Angus Ritossa, Ankang Sun, Mashbat Suzuki

    Abstract: We study the envy-free house allocation problem when agents have uncertain preferences over items and consider several well-studied preference uncertainty models. The central problem that we focus on is computing an allocation that has the highest probability of being envy-free. We show that each model leads to a distinct set of algorithmic and complexity results, including detailed results on (in… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

    Comments: To appear in the proceeding of AAAI2024

  35. arXiv:2310.12900  [pdf

    cs.LG cs.AI

    Personalized human mobility prediction for HuMob challenge

    Authors: Masahiro Suzuki, Shomu Furuta, Yusuke Fukazawa

    Abstract: We explain the methodology used to create the data submitted to HuMob Challenge, a data analysis competition for human mobility prediction. We adopted a personalized model to predict the individual's movement trajectory from their data, instead of predicting from the overall movement, based on the hypothesis that human movement is unique to each person. We devised the features such as the date and… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

  36. arXiv:2310.10083  [pdf, other

    cs.CL

    JMedLoRA:Medical Domain Adaptation on Japanese Large Language Models using Instruction-tuning

    Authors: Issey Sukeda, Masahiro Suzuki, Hiroki Sakaji, Satoshi Kodera

    Abstract: In the ongoing wave of impact driven by large language models (LLMs) like ChatGPT, the adaptation of LLMs to medical domain has emerged as a crucial research frontier. Since mainstream LLMs tend to be designed for general-purpose applications, constructing a medical LLM through domain adaptation is a huge challenge. While instruction-tuning is used to fine-tune some LLMs, its precise roles in doma… ▽ More

    Submitted 30 November, 2023; v1 submitted 16 October, 2023; originally announced October 2023.

    Comments: 8 pages, 1 figures

  37. arXiv:2309.04031  [pdf, other

    cs.CL cs.SD eess.AS

    Multiple Representation Transfer from Large Language Models to End-to-End ASR Systems

    Authors: Takuma Udagawa, Masayuki Suzuki, Gakuto Kurata, Masayasu Muraoka, George Saon

    Abstract: Transferring the knowledge of large language models (LLMs) is a promising technique to incorporate linguistic knowledge into end-to-end automatic speech recognition (ASR) systems. However, existing works only transfer a single representation of LLM (e.g. the last layer of pretrained BERT), while the representation of a text is inherently non-unique and can be obtained variously from different laye… ▽ More

    Submitted 25 December, 2023; v1 submitted 7 September, 2023; originally announced September 2023.

    Comments: Accepted to ICASSP 2024

  38. arXiv:2309.03412  [pdf, other

    cs.CL

    From Base to Conversational: Japanese Instruction Dataset and Tuning Large Language Models

    Authors: Masahiro Suzuki, Masanori Hirano, Hiroki Sakaji

    Abstract: Instruction tuning is essential for large language models (LLMs) to become interactive. While many instruction tuning datasets exist in English, there is a noticeable lack in other languages. Also, their effectiveness has not been well verified in non-English languages. We construct a Japanese instruction dataset by expanding and filtering existing datasets and apply the dataset to a Japanese pre-… ▽ More

    Submitted 5 November, 2023; v1 submitted 6 September, 2023; originally announced September 2023.

    Comments: 10 pages, 1 figure, 2 tables. The paper is a camera-ready version of IEEE BigData 2023

  39. Mixed Fair Division: A Survey

    Authors: Shengxin Liu, Xinhang Lu, Mashbat Suzuki, Toby Walsh

    Abstract: Fair division considers the allocation of scarce resources among agents in such a way that every agent gets a fair share. It is a fundamental problem in society and has received significant attention and rapid developments from the game theory and artificial intelligence communities in recent years. The majority of the fair division literature can be divided along at least two orthogonal direction… ▽ More

    Submitted 12 August, 2024; v1 submitted 15 June, 2023; originally announced June 2023.

    Comments: Appears in the 38th AAAI Conference on Artificial Intelligence (AAAI), Senior Member Presentation Track, 2024

    Journal ref: Journal of Artificial Intelligence Research (JAIR), 80:1373-1406, 2024

  40. arXiv:2305.19684  [pdf, other

    cs.LG cs.AI stat.ML

    End-to-end Training of Deep Boltzmann Machines by Unbiased Contrastive Divergence with Local Mode Initialization

    Authors: Shohei Taniguchi, Masahiro Suzuki, Yusuke Iwasawa, Yutaka Matsuo

    Abstract: We address the problem of biased gradient estimation in deep Boltzmann machines (DBMs). The existing method to obtain an unbiased estimator uses a maximal coupling based on a Gibbs sampler, but when the state is high-dimensional, it takes a long time to converge. In this study, we propose to use a coupling based on the Metropolis-Hastings (MH) and to initialize the state around a local mode of the… ▽ More

    Submitted 31 May, 2023; originally announced May 2023.

    Comments: Accepted at ICML 2023

  41. arXiv:2305.12720  [pdf, ps, other

    cs.CL cs.AI

    llm-japanese-dataset v0: Construction of Japanese Chat Dataset for Large Language Models and its Methodology

    Authors: Masanori Hirano, Masahiro Suzuki, Hiroki Sakaji

    Abstract: This study constructed a Japanese chat dataset for tuning large language models (LLMs), which consist of about 8.4 million records. Recently, LLMs have been developed and gaining popularity. However, high-performing LLMs are usually mainly for English. There are two ways to support languages other than English by those LLMs: constructing LLMs from scratch or tuning existing models. However, in bot… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

    Comments: 12 pages

  42. arXiv:2303.03642  [pdf, ps, other

    cs.GT econ.TH

    Best-of-Both-Worlds Fairness in Committee Voting

    Authors: Haris Aziz, Xinhang Lu, Mashbat Suzuki, Jeremy Vollen, Toby Walsh

    Abstract: The best-of-both-worlds paradigm advocates an approach that achieves desirable properties both ex-ante and ex-post. We launch a best-of-both-worlds fairness perspective for the important social choice setting of approval-based committee voting. To this end, we initiate work on ex-ante proportional representation properties in this domain and formalize a hierarchy of notions including Individual Fa… ▽ More

    Submitted 25 December, 2023; v1 submitted 6 March, 2023; originally announced March 2023.

    Comments: Appears in the 19th Conference on Web and Internet Economics (WINE), 2023

  43. arXiv:2301.05832  [pdf, other

    cs.RO cs.AI cs.LG

    World Models and Predictive Coding for Cognitive and Developmental Robotics: Frontiers and Challenges

    Authors: Tadahiro Taniguchi, Shingo Murata, Masahiro Suzuki, Dimitri Ognibene, Pablo Lanillos, Emre Ugur, Lorenzo Jamone, Tomoaki Nakamura, Alejandra Ciria, Bruno Lara, Giovanni Pezzulo

    Abstract: Creating autonomous robots that can actively explore the environment, acquire knowledge and learn skills continuously is the ultimate achievement envisioned in cognitive and developmental robotics. Their learning processes should be based on interactions with their physical and social world in the manner of human learning and cognitive development. Based on this context, in this paper, we focus on… ▽ More

    Submitted 14 January, 2023; originally announced January 2023.

    Comments: 28 pages, 3 figures

  44. arXiv:2211.00879  [pdf, ps, other

    cs.GT

    Fair Allocation of Two Types of Chores

    Authors: Haris Aziz, Jeremy Lindsay, Angus Ritossa, Mashbat Suzuki

    Abstract: We consider the problem of fair allocation of indivisible chores under additive valuations. We assume that the chores are divided into two types and under this scenario, we present several results. Our first result is a new characterization of Pareto optimal allocations in our setting, and a polynomial-time algorithm to compute an envy-free up to one item (EF1) and Pareto optimal allocation. We th… ▽ More

    Submitted 24 May, 2023; v1 submitted 2 November, 2022; originally announced November 2022.

  45. arXiv:2210.08703  [pdf, ps, other

    cs.HC

    Spoken Dialogue System Based on Attribute Vector for Travel Agent Robot

    Authors: Motoyuki Suzuki, Shintaro Sodeya, Taichi Nakamura

    Abstract: In this study, we develop a dialogue system for a dialogue robot competition. In the system, the characteristics of sightseeing spots are expressed as "attribute vectors" in advance, and the user is questioned on the different attributes of the two candidate spots. Consequently, the system can make recommendations based on user intentions. A dialogue experiment is conducted during a preliminary ro… ▽ More

    Submitted 16 October, 2022; originally announced October 2022.

    Comments: This paper is part of the proceedings of the Dialogue Robot Competition 2022

  46. A survey of multimodal deep generative models

    Authors: Masahiro Suzuki, Yutaka Matsuo

    Abstract: Multimodal learning is a framework for building models that make predictions based on different types of modalities. Important challenges in multimodal learning are the inference of shared representations from arbitrary modalities and cross-modal generation via these representations; however, achieving this requires taking the heterogeneous nature of multimodal data into account. In recent years,… ▽ More

    Submitted 5 July, 2022; originally announced July 2022.

    Comments: Published in Advanced Robotics

    Journal ref: Advanced Robotics, 36:5-6, 261-278, 2022

  47. arXiv:2206.05966  [pdf, other

    cs.GT cs.CC cs.MA econ.TH

    Coordinating Monetary Contributions in Participatory Budgeting

    Authors: Haris Aziz, Sujit Gujar, Manisha Padala, Mashbat Suzuki, Jeremy Vollen

    Abstract: We formalize a framework for coordinating funding and selecting projects, the costs of which are shared among agents with quasi-linear utility functions and individual budgets. Our model contains the classical discrete participatory budgeting model as a special case, while capturing other useful scenarios. We propose several important axioms and objectives and study how well they can be simultaneo… ▽ More

    Submitted 22 February, 2023; v1 submitted 13 June, 2022; originally announced June 2022.

    Comments: In this version, we include results regarding single minded valuations. We have also corrected a bug in the proof of Lemma 1

  48. arXiv:2205.14798  [pdf, other

    cs.GT cs.AI cs.MA econ.TH

    Random Rank: The One and Only Strategyproof and Proportionally Fair Randomized Facility Location Mechanism

    Authors: Haris Aziz, Alexander Lam, Mashbat Suzuki, Toby Walsh

    Abstract: Proportionality is an attractive fairness concept that has been applied to a range of problems including the facility location problem, a classic problem in social choice. In our work, we propose a concept called Strong Proportionality, which ensures that when there are two groups of agents at different locations, both groups incur the same total cost. We show that although Strong Proportionality… ▽ More

    Submitted 14 June, 2022; v1 submitted 29 May, 2022; originally announced May 2022.

  49. arXiv:2204.00212  [pdf, ps, other

    cs.CL cs.SD eess.AS

    Effect and Analysis of Large-scale Language Model Rescoring on Competitive ASR Systems

    Authors: Takuma Udagawa, Masayuki Suzuki, Gakuto Kurata, Nobuyasu Itoh, George Saon

    Abstract: Large-scale language models (LLMs) such as GPT-2, BERT and RoBERTa have been successfully applied to ASR N-best rescoring. However, whether or how they can benefit competitive, near state-of-the-art ASR systems remains unexplored. In this study, we incorporate LLM rescoring into one of the most competitive ASR baselines: the Conformer-Transducer model. We demonstrate that consistent improvement is… ▽ More

    Submitted 18 August, 2022; v1 submitted 1 April, 2022; originally announced April 2022.

    Comments: Accepted to Interspeech 2022

  50. arXiv:2203.15176  [pdf, other

    cs.CL cs.SD eess.AS

    Improving Generalization of Deep Neural Network Acoustic Models with Length Perturbation and N-best Based Label Smoothing

    Authors: Xiaodong Cui, George Saon, Tohru Nagano, Masayuki Suzuki, Takashi Fukuda, Brian Kingsbury, Gakuto Kurata

    Abstract: We introduce two techniques, length perturbation and n-best based label smoothing, to improve generalization of deep neural network (DNN) acoustic models for automatic speech recognition (ASR). Length perturbation is a data augmentation algorithm that randomly drops and inserts frames of an utterance to alter the length of the speech feature sequence. N-best based label smoothing randomly injects… ▽ More

    Submitted 28 March, 2022; originally announced March 2022.

    Comments: Submitted to Interspeech 2022