Skip to main content

Showing 1–10 of 10 results for author: Iklassov, Z

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.21887  [pdf, ps, other

    cs.AI cs.CE cs.LG

    SVRPBench: A Realistic Benchmark for Stochastic Vehicle Routing Problem

    Authors: Ahmed Heakl, Yahia Salaheldin Shaaban, Martin Takac, Salem Lahlou, Zangir Iklassov

    Abstract: Robust routing under uncertainty is central to real-world logistics, yet most benchmarks assume static, idealized settings. We present SVRPBench, the first open benchmark to capture high-fidelity stochastic dynamics in vehicle routing at urban scale. Spanning more than 500 instances with up to 1000 customers, it simulates realistic delivery conditions: time-dependent congestion, log-normal delays,… ▽ More

    Submitted 29 May, 2025; v1 submitted 27 May, 2025; originally announced May 2025.

    Comments: 18 pages, 14 figures, 11 tables

  2. arXiv:2505.12135  [pdf, other

    cs.AI cs.CL

    LLM-BABYBENCH: Understanding and Evaluating Grounded Planning and Reasoning in LLMs

    Authors: Omar Choukrani, Idriss Malek, Daniil Orel, Zhuohan Xie, Zangir Iklassov, Martin Takáč, Salem Lahlou

    Abstract: Assessing the capacity of Large Language Models (LLMs) to plan and reason within the constraints of interactive environments is crucial for developing capable AI agents. We introduce $\textbf{LLM-BabyBench}$, a new benchmark suite designed specifically for this purpose. Built upon a textual adaptation of the procedurally generated BabyAI grid world, this suite evaluates LLMs on three fundamental a… ▽ More

    Submitted 17 May, 2025; originally announced May 2025.

  3. arXiv:2501.13491  [pdf, other

    cs.CL cs.AI

    RECALL: Library-Like Behavior In Language Models is Enhanced by Self-Referencing Causal Cycles

    Authors: Munachiso Nwadike, Zangir Iklassov, Toluwani Aremu, Tatsuya Hiraoka, Velibor Bojkovic, Benjamin Heinzerling, Hilal Alqaubeh, Martin Takáč, Kentaro Inui

    Abstract: We introduce the concept of the self-referencing causal cycle (abbreviated RECALL) - a mechanism that enables large language models (LLMs) to bypass the limitations of unidirectional causality, which underlies a phenomenon known as the reversal curse. When an LLM is prompted with sequential data, it often fails to recall preceding context. For example, when we ask an LLM to recall the line precedi… ▽ More

    Submitted 23 January, 2025; originally announced January 2025.

  4. arXiv:2412.16188  [pdf, other

    cs.LG cs.AI

    A Decade of Deep Learning: A Survey on The Magnificent Seven

    Authors: Dilshod Azizov, Muhammad Arslan Manzoor, Velibor Bojkovic, Yingxu Wang, Zixiao Wang, Zangir Iklassov, Kailong Zhao, Liang Li, Siwei Liu, Yu Zhong, Wei Liu, Shangsong Liang

    Abstract: Deep learning has fundamentally reshaped the landscape of artificial intelligence over the past decade, enabling remarkable achievements across diverse domains. At the heart of these developments lie multi-layered neural network architectures that excel at automatic feature extraction, leading to significant improvements in machine learning tasks. To demystify these advances and offer accessible g… ▽ More

    Submitted 13 December, 2024; originally announced December 2024.

  5. arXiv:2405.17950  [pdf, other

    cs.AI

    Self-Guiding Exploration for Combinatorial Problems

    Authors: Zangir Iklassov, Yali Du, Farkhad Akimov, Martin Takac

    Abstract: Large Language Models (LLMs) have become pivotal in addressing reasoning tasks across diverse domains, including arithmetic, commonsense, and symbolic reasoning. They utilize prompting techniques such as Exploration-of-Thought, Decomposition, and Refinement to effectively navigate and solve intricate tasks. Despite these advancements, the application of LLMs to Combinatorial Problems (CPs), known… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: 22 pages

  6. arXiv:2402.09765  [pdf, other

    cs.AI

    Reinforcement Learning for Solving Stochastic Vehicle Routing Problem with Time Windows

    Authors: Zangir Iklassov, Ikboljon Sobirov, Ruben Solozabal, Martin Takac

    Abstract: This paper introduces a reinforcement learning approach to optimize the Stochastic Vehicle Routing Problem with Time Windows (SVRP), focusing on reducing travel costs in goods delivery. We develop a novel SVRP formulation that accounts for uncertain travel costs and demands, alongside specific customer time windows. An attention-based neural network trained through reinforcement learning is employ… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

  7. arXiv:2311.07708  [pdf, other

    cs.AI cs.CE cs.LG

    Reinforcement Learning for Solving Stochastic Vehicle Routing Problem

    Authors: Zangir Iklassov, Ikboljon Sobirov, Ruben Solozabal, Martin Takac

    Abstract: This study addresses a gap in the utilization of Reinforcement Learning (RL) and Machine Learning (ML) techniques in solving the Stochastic Vehicle Routing Problem (SVRP) that involves the challenging task of optimizing vehicle routes under uncertain conditions. We propose a novel end-to-end framework that comprehensively addresses the key sources of stochasticity in SVRP and utilizes an RL agent… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

    Comments: 14 pages, accepted to ACML24

  8. arXiv:2206.04423  [pdf, other

    cs.LG cs.AI

    Learning to generalize Dispatching rules on the Job Shop Scheduling

    Authors: Zangir Iklassov, Dmitrii Medvedev, Ruben Solozabal, Martin Takac

    Abstract: This paper introduces a Reinforcement Learning approach to better generalize heuristic dispatching rules on the Job-shop Scheduling Problem (JSP). Current models on the JSP do not focus on generalization, although, as we show in this work, this is key to learning better heuristics on the problem. A well-known technique to improve generalization is to learn on increasingly complex instances using C… ▽ More

    Submitted 15 November, 2022; v1 submitted 9 June, 2022; originally announced June 2022.

  9. arXiv:2205.13189  [pdf

    cs.LG cs.AI cs.CV

    AI for Porosity and Permeability Prediction from Geologic Core X-Ray Micro-Tomography

    Authors: Zangir Iklassov, Dmitrii Medvedev, Otabek Nazarov, Shakhboz Razzokov

    Abstract: Geologic cores are rock samples that are extracted from deep under the ground during the well drilling process. They are used for petroleum reservoirs' performance characterization. Traditionally, physical studies of cores are carried out by the means of manual time-consuming experiments. With the development of deep learning, scientists actively started working on developing machine-learning-base… ▽ More

    Submitted 28 November, 2022; v1 submitted 26 May, 2022; originally announced May 2022.

  10. arXiv:2205.12888  [pdf, other

    cs.LG cs.AI

    Robust Reinforcement Learning on Graphs for Logistics optimization

    Authors: Zangir Iklassov, Dmitrii Medvedev

    Abstract: Logistics optimization nowadays is becoming one of the hottest areas in the AI community. In the past year, significant advancements in the domain were achieved by representing the problem in a form of graph. Another promising area of research was to apply reinforcement learning algorithms to the above task. In our work, we made advantage of using both approaches and apply reinforcement learning o… ▽ More

    Submitted 25 May, 2022; originally announced May 2022.

    Comments: Keywords: Graph Neural Network (GNN), Logistics optimization, Reinforcement Learning