Skip to main content

Showing 1–6 of 6 results for author: Horváth, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2411.05903  [pdf

    cs.LG cs.AI cs.CL cs.CV cs.SD eess.AS

    Towards Multi-Modal Mastery: A 4.5B Parameter Truly Multi-Modal Small Language Model

    Authors: Ben Koska, Mojmír Horváth

    Abstract: We present a novel 4.5B parameter small language model that can handle multiple input and output modalities, including text, images, videos, and audio. Despite its small size, the model achieves near state-of-the-art performance on a variety of tasks, demonstrating the potential of multi-modal models to tackle complex real-world problems. Our approach leverages recent advancements in language mode… ▽ More

    Submitted 8 November, 2024; originally announced November 2024.

  2. arXiv:2205.13909  [pdf, other

    cs.LG cs.AI cs.CR

    (De-)Randomized Smoothing for Decision Stump Ensembles

    Authors: Miklós Z. Horváth, Mark Niklas Müller, Marc Fischer, Martin Vechev

    Abstract: Tree-based models are used in many high-stakes application domains such as finance and medicine, where robustness and interpretability are of utmost importance. Yet, methods for improving and certifying their robustness are severely under-explored, in contrast to those focusing on neural networks. Targeting this important challenge, we propose deterministic smoothing for decision stump ensembles.… ▽ More

    Submitted 14 November, 2022; v1 submitted 27 May, 2022; originally announced May 2022.

    Comments: NeurIPS 2022 Paper

  3. arXiv:2204.00487  [pdf, other

    cs.LG cs.AI cs.CR

    Robust and Accurate -- Compositional Architectures for Randomized Smoothing

    Authors: Miklós Z. Horváth, Mark Niklas Müller, Marc Fischer, Martin Vechev

    Abstract: Randomized Smoothing (RS) is considered the state-of-the-art approach to obtain certifiably robust models for challenging tasks. However, current RS approaches drastically decrease standard accuracy on unperturbed data, severely limiting their real-world utility. To address this limitation, we propose a compositional architecture, ACES, which certifiably decides on a per-sample basis whether to us… ▽ More

    Submitted 1 April, 2022; originally announced April 2022.

    Comments: Presented at the ICLR 2022 Workshop on Socially Responsible Machine Learning

  4. arXiv:2106.06946  [pdf, other

    cs.LG cs.AI cs.CV

    Boosting Randomized Smoothing with Variance Reduced Classifiers

    Authors: Miklós Z. Horváth, Mark Niklas Müller, Marc Fischer, Martin Vechev

    Abstract: Randomized Smoothing (RS) is a promising method for obtaining robustness certificates by evaluating a base model under noise. In this work, we: (i) theoretically motivate why ensembles are a particularly suitable choice as base models for RS, and (ii) empirically confirm this choice, obtaining state-of-the-art results in multiple settings. The key insight of our work is that the reduced variance o… ▽ More

    Submitted 30 March, 2022; v1 submitted 13 June, 2021; originally announced June 2021.

    Comments: ICLR 2022 Spotlight Paper

  5. arXiv:2006.09895  [pdf, other

    cs.DC cs.AI

    Ranking and benchmarking framework for sampling algorithms on synthetic data streams

    Authors: József Dániel Gáspár, Martin Horváth, Győző Horváth, Zoltán Zvara

    Abstract: In the fields of big data, AI, and streaming processing, we work with large amounts of data from multiple sources. Due to memory and network limitations, we process data streams on distributed systems to alleviate computational and network loads. When data streams with non-uniform distributions are processed, we often observe overloaded partitions due to the use of simple hash partitioning. To tac… ▽ More

    Submitted 17 June, 2020; originally announced June 2020.

  6. arXiv:1608.06298  [pdf, other

    cs.IR

    Infusing Collaborative Recommenders with Distributed Representations

    Authors: Greg Zanotti, Miller Horvath, Lucas Nunes Barbosa, Venkata Trinadh Kumar Gupta Immedisetty, Jonathan Gemmell

    Abstract: Recommender systems assist users in navigating complex information spaces and focus their attention on the content most relevant to their needs. Often these systems rely on user activity or descriptions of the content. Social annotation systems, in which users collaboratively assign tags to items, provide another means to capture information about users and items. Each of these data sources provid… ▽ More

    Submitted 22 August, 2016; originally announced August 2016.