Skip to main content

Showing 1–12 of 12 results for author: Kornuta, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2508.13564  [pdf, ps, other

    cs.CV cs.AI cs.LG cs.RO

    The 9th AI City Challenge

    Authors: Zheng Tang, Shuo Wang, David C. Anastasiu, Ming-Ching Chang, Anuj Sharma, Quan Kong, Norimasa Kobori, Munkhjargal Gochoo, Ganzorig Batnasan, Munkh-Erdene Otgonbold, Fady Alnajjar, Jun-Wei Hsieh, Tomasz Kornuta, Xiaolong Li, Yilin Zhao, Han Zhang, Subhashree Radhakrishnan, Arihant Jain, Ratnesh Kumar, Vidya N. Murali, Yuxing Wang, Sameer Satish Pusegaonkar, Yizhou Wang, Sujit Biswas, Xunlei Wu , et al. (3 additional authors not shown)

    Abstract: The ninth AI City Challenge continues to advance real-world applications of computer vision and AI in transportation, industrial automation, and public safety. The 2025 edition featured four tracks and saw a 17% increase in participation, with 245 teams from 15 countries registered on the evaluation server. Public release of challenge datasets led to over 30,000 downloads to date. Track 1 focused… ▽ More

    Submitted 19 August, 2025; originally announced August 2025.

    Comments: Summary of the 9th AI City Challenge Workshop in conjunction with ICCV 2025

  2. arXiv:2212.07942  [pdf, other

    q-fin.CP cs.LG

    Multi-Agent Dynamic Pricing in a Blockchain Protocol Using Gaussian Bandits

    Authors: Alexis Asseman, Tomasz Kornuta, Anirudh Patel, Matt Deible, Sam Green

    Abstract: The Graph Protocol indexes historical blockchain transaction data and makes it available for querying. As the protocol is decentralized, there are many independent Indexers that index and compete with each other for serving queries to the Consumers. One dimension along which Indexers compete is pricing. In this paper, we propose a bandit-based algorithm for maximization of Indexers' revenue via Co… ▽ More

    Submitted 6 January, 2023; v1 submitted 13 December, 2022; originally announced December 2022.

  3. arXiv:2008.12335  [pdf, other

    cs.LG stat.ML

    A Fast and Robust BERT-based Dialogue State Tracker for Schema-Guided Dialogue Dataset

    Authors: Vahid Noroozi, Yang Zhang, Evelina Bakhturina, Tomasz Kornuta

    Abstract: Dialog State Tracking (DST) is one of the most crucial modules for goal-oriented dialogue systems. In this paper, we introduce FastSGT (Fast Schema Guided Tracker), a fast and robust BERT-based model for state tracking in goal-oriented dialogue systems. The proposed model is designed for the Schema-Guided Dialogue (SGD) dataset which contains natural language descriptions for all the entities incl… ▽ More

    Submitted 27 August, 2020; originally announced August 2020.

    Comments: Accepted to the Workshop on Conversational Systems Towards Mainstream Adoption at KDD 2020

  4. arXiv:1911.11938  [pdf, other

    cs.CV cs.AI cs.LG

    Transfer Learning in Visual and Relational Reasoning

    Authors: T. S. Jayram, Vincent Marois, Tomasz Kornuta, Vincent Albouy, Emre Sevgen, Ahmet S. Ozcan

    Abstract: Transfer learning has become the de facto standard in computer vision and natural language processing, especially where labeled data is scarce. Accuracy can be significantly improved by using pre-trained models and subsequent fine-tuning. In visual reasoning tasks, such as image question answering, transfer learning is more complex. In addition to transferring the capability to recognize visual fe… ▽ More

    Submitted 14 February, 2020; v1 submitted 26 November, 2019; originally announced November 2019.

    Comments: 18 pages; more baseline comparisons; additional clarifications

  5. arXiv:1910.08654  [pdf, other

    cs.LG

    PyTorchPipe: a framework for rapid prototyping of pipelines combining language and vision

    Authors: Tomasz Kornuta

    Abstract: Access to vast amounts of data along with affordable computational power stimulated the reincarnation of neural networks. The progress could not be achieved without adequate software tools, lowering the entry bar for the next generations of researchers and developers. The paper introduces PyTorchPipe (PTP), a framework built on top of PyTorch. Answering the recent needs and trends in machine learn… ▽ More

    Submitted 18 October, 2019; originally announced October 2019.

    Comments: Paper accepted for SysML 2019 workshop at 33rd Conference on Neural Information Processing Systems (NeurIPS 2019)

  6. arXiv:1905.12008  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    Leveraging Medical Visual Question Answering with Supporting Facts

    Authors: Tomasz Kornuta, Deepta Rajan, Chaitanya Shivade, Alexis Asseman, Ahmet S. Ozcan

    Abstract: In this working notes paper, we describe IBM Research AI (Almaden) team's participation in the ImageCLEF 2019 VQA-Med competition. The challenge consists of four question-answering tasks based on radiology images. The diversity of imaging modalities, organs and disease types combined with a small imbalanced training set made this a highly complex problem. To overcome these difficulties, we impleme… ▽ More

    Submitted 28 May, 2019; originally announced May 2019.

    Comments: Working notes from the ImageCLEF 2019 VQA-Med competition

  7. arXiv:1811.06529  [pdf, other

    cs.CV cs.LG

    On transfer learning using a MAC model variant

    Authors: Vincent Marois, T. S. Jayram, Vincent Albouy, Tomasz Kornuta, Younes Bouhadjar, Ahmet S. Ozcan

    Abstract: We introduce a variant of the MAC model (Hudson and Manning, ICLR 2018) with a simplified set of equations that achieves comparable accuracy, while training faster. We evaluate both models on CLEVR and CoGenT, and show that, transfer learning with fine-tuning results in a 15 point increase in accuracy, matching the state of the art. Finally, in contrast, we demonstrate that improper fine-tuning ca… ▽ More

    Submitted 16 November, 2018; v1 submitted 15 November, 2018; originally announced November 2018.

    Comments: Paper accepted for Visually Grounded Interaction and Language (ViGIL) Workshop, NIPS 2018, Montreeal, Canada

  8. arXiv:1809.11087  [pdf, other

    cs.LG cs.NE stat.ML

    Learning to Remember, Forget and Ignore using Attention Control in Memory

    Authors: T. S. Jayram, Younes Bouhadjar, Ryan L. McAvoy, Tomasz Kornuta, Alexis Asseman, Kamil Rocki, Ahmet S. Ozcan

    Abstract: Typical neural networks with external memory do not effectively separate capacity for episodic and working memory as is required for reasoning in humans. Applying knowledge gained from psychological studies, we designed a new model called Differentiable Working Memory (DWM) in order to specifically emulate human working memory. As it shows the same functional characteristics as working memory, it… ▽ More

    Submitted 28 September, 2018; originally announced September 2018.

    Comments: 20 pages

    ACM Class: I.2.6

  9. arXiv:1809.10847  [pdf, other

    cs.LG cs.NE stat.ML

    Using Multi-task and Transfer Learning to Solve Working Memory Tasks

    Authors: T. S. Jayram, Tomasz Kornuta, Ryan L. McAvoy, Ahmet S. Ozcan

    Abstract: We propose a new architecture called Memory-Augmented Encoder-Solver (MAES) that enables transfer learning to solve complex working memory tasks adapted from cognitive psychology. It uses dual recurrent neural network controllers, inside the encoder and solver, respectively, that interface with a shared memory module and is completely differentiable. We study different types of encoders in a syste… ▽ More

    Submitted 28 September, 2018; originally announced September 2018.

    Comments: 16 pages

    ACM Class: I.2.6

  10. arXiv:1801.09718  [pdf, other

    cs.CV

    Object-based reasoning in VQA

    Authors: Mikyas T. Desta, Larry Chen, Tomasz Kornuta

    Abstract: Visual Question Answering (VQA) is a novel problem domain where multi-modal inputs must be processed in order to solve the task given in the form of a natural language. As the solutions inherently require to combine visual and natural language processing with abstract reasoning, the problem is considered as AI-complete. Recent advances indicate that using high-level, abstract facts extracted from… ▽ More

    Submitted 29 January, 2018; originally announced January 2018.

    Comments: 10 pages, 15 figures, published as a conference paper at 2018 IEEE Winter Conf. on Applications of Computer Vision (WACV'2018)

  11. arXiv:1610.07675  [pdf, other

    cs.LG cs.AI cs.NE

    Surprisal-Driven Zoneout

    Authors: Kamil Rocki, Tomasz Kornuta, Tegan Maharaj

    Abstract: We propose a novel method of regularization for recurrent neural networks called suprisal-driven zoneout. In this method, states zoneout (maintain their previous value rather than updating), when the suprisal (discrepancy between the last state's prediction and target) is small. Thus regularization is adaptive and input-driven on a per-neuron basis. We demonstrate the effectiveness of this idea by… ▽ More

    Submitted 13 December, 2016; v1 submitted 24 October, 2016; originally announced October 2016.

    Comments: Published at the Continual Learning and Deep Networks Workshop; NIPS 2016

  12. arXiv:1610.06492  [pdf, other

    cs.CV cs.LG

    Utilization of Deep Reinforcement Learning for saccadic-based object visual search

    Authors: Tomasz Kornuta, Kamil Rocki

    Abstract: The paper focuses on the problem of learning saccades enabling visual object search. The developed system combines reinforcement learning with a neural network for learning to predict the possible outcomes of its actions. We validated the solution in three types of environment consisting of (pseudo)-randomly generated matrices of digits. The experimental verification is followed by the discussion… ▽ More

    Submitted 20 October, 2016; originally announced October 2016.

    Comments: Paper submitted to special session on Machine Intelligence organized during 23rd International AUTOMATION Conference