Skip to main content

Showing 1–18 of 18 results for author: Shibata, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2507.06654  [pdf, ps, other

    cs.CV cs.AI cs.IR

    MS-DPPs: Multi-Source Determinantal Point Processes for Contextual Diversity Refinement of Composite Attributes in Text to Image Retrieval

    Authors: Naoya Sogi, Takashi Shibata, Makoto Terao, Masanori Suganuma, Takayuki Okatani

    Abstract: Result diversification (RD) is a crucial technique in Text-to-Image Retrieval for enhancing the efficiency of a practical application. Conventional methods focus solely on increasing the diversity metric of image appearances. However, the diversity metric and its desired value vary depending on the application, which limits the applications of RD. This paper proposes a novel task called CDR-CA (Co… ▽ More

    Submitted 9 July, 2025; originally announced July 2025.

    Comments: IJCAI 2025. Code: https://github.com/NEC-N-SOGI/msdpp

  2. arXiv:2505.08498  [pdf, ps, other

    cs.CL cs.AI

    LCES: Zero-shot Automated Essay Scoring via Pairwise Comparisons Using Large Language Models

    Authors: Takumi Shibata, Yuichi Miyamura

    Abstract: Recent advances in large language models (LLMs) have enabled zero-shot automated essay scoring (AES), providing a promising way to reduce the cost and effort of essay scoring in comparison with manual grading. However, most existing zero-shot approaches rely on LLMs to directly generate absolute scores, which often diverge from human evaluations owing to model biases and inconsistent scoring. To a… ▽ More

    Submitted 13 May, 2025; originally announced May 2025.

    Comments: 14 pages, 4 figures

  3. arXiv:2412.21205  [pdf, other

    cs.CV cs.AI cs.LG

    Action-Agnostic Point-Level Supervision for Temporal Action Detection

    Authors: Shuhei M. Yoshida, Takashi Shibata, Makoto Terao, Takayuki Okatani, Masashi Sugiyama

    Abstract: We propose action-agnostic point-level (AAPL) supervision for temporal action detection to achieve accurate action instance detection with a lightly annotated dataset. In the proposed scheme, a small portion of video frames is sampled in an unsupervised manner and presented to human annotators, who then label the frames with action categories. Unlike point-level supervision, which requires annotat… ▽ More

    Submitted 30 December, 2024; originally announced December 2024.

    Comments: AAAI-25. Technical appendices included. 15 pages, 3 figures, 11 tables

  4. arXiv:2411.00409  [pdf, other

    cs.LG

    Black-Box Forgetting

    Authors: Yusuke Kuwana, Yuta Goto, Takashi Shibata, Go Irie

    Abstract: Large-scale pre-trained models (PTMs) provide remarkable zero-shot classification capability covering a wide variety of object classes. However, practical applications do not always require the classification of all kinds of objects, and leaving the model capable of recognizing unnecessary classes not only degrades overall accuracy but also leads to operational disadvantages. To mitigate this issu… ▽ More

    Submitted 1 November, 2024; originally announced November 2024.

    Comments: NeurIPS 2024

  5. arXiv:2407.12346  [pdf, other

    cs.CV cs.IR cs.LG

    Object-Aware Query Perturbation for Cross-Modal Image-Text Retrieval

    Authors: Naoya Sogi, Takashi Shibata, Makoto Terao

    Abstract: The pre-trained vision and language (V\&L) models have substantially improved the performance of cross-modal image-text retrieval. In general, however, V\&L models have limited retrieval performance for small objects because of the rough alignment between words and the small objects in the image. In contrast, it is known that human cognition is object-centric, and we pay more attention to importan… ▽ More

    Submitted 24 September, 2024; v1 submitted 17 July, 2024; originally announced July 2024.

    Comments: ECCV 2024. Code: https://github.com/NEC-N-SOGI/query-perturbation

  6. arXiv:2404.03415  [pdf, other

    cs.RO cs.CV

    Future Predictive Success-or-Failure Classification for Long-Horizon Robotic Tasks

    Authors: Naoya Sogi, Hiroyuki Oyama, Takashi Shibata, Makoto Terao

    Abstract: Automating long-horizon tasks with a robotic arm has been a central research topic in robotics. Optimization-based action planning is an efficient approach for creating an action plan to complete a given task. Construction of a reliable planning method requires a design process of conditions, e.g., to avoid collision between objects. The design process, however, has two critical issues: 1) iterati… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

    Comments: IJCNN 2024

  7. arXiv:2312.02771  [pdf, ps, other

    cs.ET physics.app-ph

    Scaling-up Memristor Monte Carlo with magnetic domain-wall physics

    Authors: Thomas Dalgaty, Shogo Yamada, Anca Molnos, Eiji Kawasaki, Thomas Mesquida, François Rummens, Tatsuo Shibata, Yukihiro Urakawa, Yukio Terasaki, Tomoyuki Sasaki, Marc Duranton

    Abstract: By exploiting the intrinsic random nature of nanoscale devices, Memristor Monte Carlo (MMC) is a promising enabler of edge learning systems. However, due to multiple algorithmic and device-level limitations, existing demonstrations have been restricted to very small neural network models and datasets. We discuss these limitations, and describe how they can be overcome, by mapping the stochastic gr… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

    Comments: Presented at the 1st workshop on Machine Learning with New Compute Paradigms (MLNCP) at NeurIPS 2023 (New Orleans, USA)

  8. arXiv:2306.05670  [pdf, other

    cs.LG cs.AI cs.CV

    One-Shot Machine Unlearning with Mnemonic Code

    Authors: Tomoya Yamashita, Masanori Yamada, Takashi Shibata

    Abstract: Ethical and privacy issues inherent in artificial intelligence (AI) applications have been a growing concern with the rapid spread of deep learning. Machine unlearning (MU) is the research area that addresses these issues by making a trained AI model forget about undesirable training data. Unfortunately, most existing MU methods incur significant time and computational costs for forgetting. Theref… ▽ More

    Submitted 25 September, 2024; v1 submitted 9 June, 2023; originally announced June 2023.

    Comments: 24 pages, welcome coments

  9. arXiv:2205.09185  [pdf, other

    physics.ins-det cs.LG hep-ex nucl-ex physics.comp-ph

    AI-assisted Optimization of the ECCE Tracking System at the Electron Ion Collider

    Authors: C. Fanelli, Z. Papandreou, K. Suresh, J. K. Adkins, Y. Akiba, A. Albataineh, M. Amaryan, I. C. Arsene, C. Ayerbe Gayoso, J. Bae, X. Bai, M. D. Baker, M. Bashkanov, R. Bellwied, F. Benmokhtar, V. Berdnikov, J. C. Bernauer, F. Bock, W. Boeglin, M. Borysova, E. Brash, P. Brindza, W. J. Briscoe, M. Brooks, S. Bueltmann , et al. (258 additional authors not shown)

    Abstract: The Electron-Ion Collider (EIC) is a cutting-edge accelerator facility that will study the nature of the "glue" that binds the building blocks of the visible matter in the universe. The proposed experiment will be realized at Brookhaven National Laboratory in approximately 10 years from now, with detector design and R&D currently ongoing. Notably, EIC is one of the first large-scale facilities to… ▽ More

    Submitted 19 May, 2022; v1 submitted 18 May, 2022; originally announced May 2022.

    Comments: 16 pages, 18 figures, 2 appendices, 3 tables

  10. arXiv:2107.11196  [pdf, other

    cs.CV

    Multi-Modal Pedestrian Detection with Large Misalignment Based on Modal-Wise Regression and Multi-Modal IoU

    Authors: Napat Wanchaitanawong, Masayuki Tanaka, Takashi Shibata, Masatoshi Okutomi

    Abstract: The combined use of multiple modalities enables accurate pedestrian detection under poor lighting conditions by using the high visibility areas from these modalities together. The vital assumption for the combination use is that there is no or only a weak misalignment between the two modalities. In general, however, this assumption often breaks in actual situations. Due to this assumption's breakd… ▽ More

    Submitted 23 July, 2021; originally announced July 2021.

    Comments: Accepted by MVA2021

  11. arXiv:2107.10524  [pdf, other

    cs.CV

    Geometric Data Augmentation Based on Feature Map Ensemble

    Authors: Takashi Shibata, Masayuki Tanaka, Masatoshi Okutomi

    Abstract: Deep convolutional networks have become the mainstream in computer vision applications. Although CNNs have been successful in many computer vision tasks, it is not free from drawbacks. The performance of CNN is dramatically degraded by geometric transformation, such as large rotations. In this paper, we propose a novel CNN architecture that can improve the robustness against geometric transformati… ▽ More

    Submitted 22 July, 2021; originally announced July 2021.

    Comments: Accepted to ICIP2021

  12. arXiv:2106.01656  [pdf, other

    cs.CV

    Generalized Domain Adaptation

    Authors: Yu Mitsuzumi, Go Irie, Daiki Ikami, Takashi Shibata

    Abstract: Many variants of unsupervised domain adaptation (UDA) problems have been proposed and solved individually. Its side effect is that a method that works for one variant is often ineffective for or not even applicable to another, which has prevented practical applications. In this paper, we give a general representation of UDA problems, named Generalized Domain Adaptation (GDA). GDA covers the major… ▽ More

    Submitted 3 June, 2021; originally announced June 2021.

    Comments: Accepted by CVPR 2021. Code is available at https://github.com/nttcslab/Generalized-Domain-Adaptation

  13. arXiv:2105.04763  [pdf, other

    cs.RO

    A Study on Simultaneous Use of a Robotic Walker and a Pneumatic Walking Assist Device Designed for PD Patients

    Authors: Abdul Ali, Rikuo Kawamoto, Tomohiro Shibata

    Abstract: Parkinson's disease (PD) is a common neurodegenerative disease that affects motor and non-motor symptoms. Postural instability and freezing of gait (FOG) are considered motor symptoms of PD resulting in falling. In this study, we investigated the effect of simultaneous use of a robotic walker and a pneumatic walking assist device (PWAD) for PD patients on gait features. The pneumatic actuated arti… ▽ More

    Submitted 12 May, 2021; v1 submitted 10 May, 2021; originally announced May 2021.

    Comments: 5 pages, 8 figures, Submitted to The 18th International Conference on Ubiquitous Robots (UR) 2021

  14. arXiv:2011.09140  [pdf, other

    cs.CL

    Diverse and Non-redundant Answer Set Extraction on Community QA based on DPPs

    Authors: Shogo Fujita, Tomohide Shibata, Manabu Okumura

    Abstract: In community-based question answering (CQA) platforms, it takes time for a user to get useful information from among many answers. Although one solution is an answer ranking method, the user still needs to read through the top-ranked answers carefully. This paper proposes a new task of selecting a diverse and non-redundant answer set rather than ranking the answers. Our method is based on determin… ▽ More

    Submitted 18 November, 2020; originally announced November 2020.

    Comments: COLING2020, 12 pages

  15. arXiv:2006.04864  [pdf, other

    cs.HC cs.CY

    Design and Development of an Automated Coimagination Support System

    Authors: John Noel Victorino, Naoto Fukunaga, Tomohiro Shibata

    Abstract: Coimagination method is a novel approach to support interactive communication for activating three (3) cognitive functions: episodic memory, division of attention, and planning. These cognitive functions are known to decline at an early stage of mild cognitive impairment (MCI). In previous studies about the coimagination method, experimenters tested different settings in different care institution… ▽ More

    Submitted 17 June, 2020; v1 submitted 5 June, 2020; originally announced June 2020.

    Comments: 6 pages, 9 figures, submitted to The 8th International Conference on Human-Agent Interaction (HAI 2020)

  16. arXiv:1905.02851  [pdf, other

    cs.IR cs.CL

    FAQ Retrieval using Query-Question Similarity and BERT-Based Query-Answer Relevance

    Authors: Wataru Sakata, Tomohide Shibata, Ribeka Tanaka, Sadao Kurohashi

    Abstract: Frequently Asked Question (FAQ) retrieval is an important task where the objective is to retrieve an appropriate Question-Answer (QA) pair from a database based on a user's query. We propose a FAQ retrieval system that considers the similarity between a user's query and a question as well as the relevance between the query and an answer. Although a common approach to FAQ retrieval is to construct… ▽ More

    Submitted 23 May, 2019; v1 submitted 7 May, 2019; originally announced May 2019.

    Comments: Accepted in SIGIR 2019 (short paper), camera ready, 4 pages

  17. arXiv:1809.09297  [pdf, other

    cs.CV

    Gradient-Based Low-Light Image Enhancement

    Authors: Masayuki Tanaka, Takashi Shibata, Masatoshi Okutomi

    Abstract: A low-light image enhancement is a highly demanded image processing technique, especially for consumer digital cameras and cameras on mobile phones. In this paper, a gradient-based low-light image enhancement algorithm is proposed. The key is to enhance the gradients of dark region, because the gradients are more sensitive for human visual system than absolute values. In addition, we involve the i… ▽ More

    Submitted 24 September, 2018; originally announced September 2018.

  18. Reading Comprehension using Entity-based Memory Network

    Authors: Xun Wang, Katsuhito Sudoh, Masaaki Nagata, Tomohide Shibata, Daisuke Kawahara, Sadao Kurohashi

    Abstract: This paper introduces a novel neural network model for question answering, the \emph{entity-based memory network}. It enhances neural networks' ability of representing and calculating information over a long period by keeping records of entities contained in text. The core component is a memory pool which comprises entities' states. These entities' states are continuously updated according to the… ▽ More

    Submitted 1 February, 2017; v1 submitted 12 December, 2016; originally announced December 2016.