Skip to main content

Showing 1–3 of 3 results for author: Tyrolski, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.03361  [pdf, other

    cs.LG

    What Matters in Hierarchical Search for Combinatorial Reasoning Problems?

    Authors: Michał Zawalski, Gracjan Góral, Michał Tyrolski, Emilia Wiśnios, Franciszek Budrowski, Marek Cygan, Łukasz Kuciński, Piotr Miłoś

    Abstract: Efficiently tackling combinatorial reasoning problems, particularly the notorious NP-hard tasks, remains a significant challenge for AI research. Recent efforts have sought to enhance planning by incorporating hierarchical high-level search strategies, known as subgoal methods. While promising, their performance against traditional low-level planners is inconsistent, raising questions about their… ▽ More

    Submitted 11 February, 2025; v1 submitted 5 June, 2024; originally announced June 2024.

    Comments: Accepted for Generative Models for Decision Making Workshop at ICLR 2024

  2. arXiv:2206.00702  [pdf, other

    cs.AI cs.LG

    Fast and Precise: Adjusting Planning Horizon with Adaptive Subgoal Search

    Authors: Michał Zawalski, Michał Tyrolski, Konrad Czechowski, Tomasz Odrzygóźdź, Damian Stachura, Piotr Piękos, Yuhuai Wu, Łukasz Kuciński, Piotr Miłoś

    Abstract: Complex reasoning problems contain states that vary in the computational cost required to determine a good action plan. Taking advantage of this property, we propose Adaptive Subgoal Search (AdaSubS), a search method that adaptively adjusts the planning horizon. To this end, AdaSubS generates diverse sets of subgoals at different distances. A verification mechanism is employed to filter out unreac… ▽ More

    Submitted 25 May, 2024; v1 submitted 1 June, 2022; originally announced June 2022.

    Comments: ICLR 2023 (notable-top-5%) website: https://sites.google.com/view/adaptivesubgoalsearch/

    ACM Class: I.2.8; I.2.6

  3. arXiv:2110.13711  [pdf, other

    cs.LG cs.CL

    Hierarchical Transformers Are More Efficient Language Models

    Authors: Piotr Nawrot, Szymon Tworkowski, Michał Tyrolski, Łukasz Kaiser, Yuhuai Wu, Christian Szegedy, Henryk Michalewski

    Abstract: Transformer models yield impressive results on many NLP and sequence modeling tasks. Remarkably, Transformers can handle long sequences which allows them to produce long coherent outputs: full paragraphs produced by GPT-3 or well-structured images produced by DALL-E. These large language models are impressive but also very inefficient and costly, which limits their applications and accessibility.… ▽ More

    Submitted 16 April, 2022; v1 submitted 26 October, 2021; originally announced October 2021.