Skip to main content

Showing 1–20 of 20 results for author: Benczúr, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2503.05767  [pdf, other

    cs.GL cs.AI cs.LG

    Mesterséges Intelligencia Kutatások Magyarországon

    Authors: András A. Benczúr, Tibor Gyimóthy, Balázs Szegedy

    Abstract: Artificial intelligence (AI) has undergone remarkable development since the mid-2000s, particularly in the fields of machine learning and deep learning, driven by the explosive growth of large databases and computational capacity. Hungarian researchers recognized the significance of AI early on, actively participating in international research and achieving significant results in both theoretical… ▽ More

    Submitted 24 February, 2025; originally announced March 2025.

    Comments: in Hungarian language. Submitted to Magyar Tudomány

  2. arXiv:2408.15923  [pdf, other

    stat.ML cs.LG

    Generalized Naive Bayes

    Authors: Edith Alice Kovács, Anna Ország, Dániel Pfeifer, András Benczúr

    Abstract: In this paper we introduce the so-called Generalized Naive Bayes structure as an extension of the Naive Bayes structure. We give a new greedy algorithm that finds a good fitting Generalized Naive Bayes (GNB) probability distribution. We prove that this fits the data at least as well as the probability distribution determined by the classical Naive Bayes (NB). Then, under a not very restrictive con… ▽ More

    Submitted 28 August, 2024; originally announced August 2024.

    Comments: 44 pages, 19 figures

    MSC Class: 62C12; 62C10; 62-07

  3. arXiv:2405.10054  [pdf, other

    cs.LG eess.SY

    A finite-sample generalization bound for stable LPV systems

    Authors: Daniel Racz, Martin Gonzalez, Mihaly Petreczky, Andras Benczur, Balint Daroczy

    Abstract: One of the main theoretical challenges in learning dynamical systems from data is providing upper bounds on the generalization error, that is, the difference between the expected prediction error and the empirical prediction error measured on some finite sample. In machine learning, a popular class of such bounds are the so-called Probably Approximately Correct (PAC) bounds. In this paper, we deri… ▽ More

    Submitted 21 May, 2024; v1 submitted 16 May, 2024; originally announced May 2024.

    Comments: 8 pages, 1 figure, under review

    MSC Class: 68 ACM Class: I.2.0

  4. arXiv:2310.09961  [pdf, other

    cs.LG stat.ME

    Theoretical Evaluation of Asymmetric Shapley Values for Root-Cause Analysis

    Authors: Domokos M. Kelen, Mihály Petreczky, Péter Kersch, András A. Benczúr

    Abstract: In this work, we examine Asymmetric Shapley Values (ASV), a variant of the popular SHAP additive local explanation method. ASV proposes a way to improve model explanations incorporating known causal relations between variables, and is also considered as a way to test for unfair discrimination in model predictions. Unexplored in previous literature, relaxing symmetry in Shapley values can have coun… ▽ More

    Submitted 15 October, 2023; originally announced October 2023.

    Comments: 10 pages, 6 figures, to be published in IEEE ICDM 2023

  5. arXiv:2308.13251  [pdf, other

    math.CO cs.DM

    Constructing and sampling partite, $3$-uniform hypergraphs with given degree sequence

    Authors: Andras Hubai, Tamas Robert Mezei, Ferenc Beres, Andras Benczur, Istvan Miklos

    Abstract: Partite, $3$-uniform hypergraphs are $3$-uniform hypergraphs in which each hyperedge contains exactly one point from each of the $3$ disjoint vertex classes. We consider the degree sequence problem of partite, $3$-uniform hypergraphs, that is, to decide if such a hypergraph with prescribed degree sequences exists. We prove that this decision problem is NP-complete in general, and give a polynomial… ▽ More

    Submitted 25 August, 2023; originally announced August 2023.

  6. arXiv:2306.15024  [pdf, other

    cs.CR cs.NI

    ethp2psim: Evaluating and deploying privacy-enhanced peer-to-peer routing protocols for the Ethereum network

    Authors: Ferenc Béres, István András Seres, Domokos M. Kelen, András A. Benczúr

    Abstract: Network-level privacy is the Achilles heel of financial privacy in cryptocurrencies. Financial privacy amounts to achieving and maintaining blockchain- and network-level privacy. Blockchain-level privacy recently received substantial attention. Specifically, several privacy-enhancing technologies were proposed and deployed to enhance blockchain-level privacy. On the other hand, network-level priva… ▽ More

    Submitted 26 June, 2023; originally announced June 2023.

  7. arXiv:2110.13619  [pdf, other

    cs.SI cs.LG

    Vaccine skepticism detection by network embedding

    Authors: Ferenc Béres, Rita Csoma, Tamás Vilmos Michaletzky, András A. Benczúr

    Abstract: We demonstrate the applicability of network embedding to vaccine skepticism, a controversial topic of long-past history. With the Covid-19 pandemic outbreak at the end of 2019, the topic is more important than ever. Only a year after the first international cases were registered, multiple vaccines were developed and passed clinical testing. Besides the challenges of development, testing, and logis… ▽ More

    Submitted 20 October, 2021; originally announced October 2021.

    Comments: The data and the source code are available on GitHub: https://github.com/ferencberes/covid-vaccine-network

    Journal ref: Extended abstract for Complex Networks 2021 (CNA21) conference

  8. arXiv:2105.15023  [pdf, other

    cs.DC

    System-aware dynamic partitioning for batch and streaming workloads

    Authors: Zoltán Zvara, Péter G. N. Szabó, Balázs Barnabás Lóránt, András A. Benczúr

    Abstract: When processing data streams with highly skewed and nonstationary key distributions, we often observe overloaded partitions when the hash partitioning fails to balance data correctly. To avoid slow tasks that delay the completion of the whole stage of computation, it is necessary to apply adaptive, on-the-fly partitioning that continuously recomputes an optimal partitioner, given the observed key… ▽ More

    Submitted 31 May, 2021; originally announced May 2021.

    Comments: 14 pages, 8 figures

    ACM Class: C.4; C.2

  9. arXiv:2005.14051  [pdf, other

    cs.CR cs.CY

    Blockchain is Watching You: Profiling and Deanonymizing Ethereum Users

    Authors: Ferenc Béres, István András Seres, András A. Benczúr, Mikerah Quintyne-Collins

    Abstract: Ethereum is the largest public blockchain by usage. It applies an account-based model, which is inferior to Bitcoin's unspent transaction output model from a privacy perspective. Due to its privacy shortcomings, recently several privacy-enhancing overlays have been deployed on Ethereum, such as non-custodial, trustless coin mixers and confidential transactions. In our privacy analysis of Ethereum'… ▽ More

    Submitted 13 October, 2020; v1 submitted 28 May, 2020; originally announced May 2020.

    Comments: 19 pages

  10. arXiv:1912.09306  [pdf, other

    cs.LG stat.ML

    Tangent Space Separability in Feedforward Neural Networks

    Authors: Bálint Daróczy, Rita Aleksziev, András Benczúr

    Abstract: Hierarchical neural networks are exponentially more efficient than their corresponding "shallow" counterpart with the same expressive power, but involve huge number of parameters and require tedious amounts of training. By approximating the tangent subspace, we suggest a sparse representation that enables switching to shallow networks, GradNet after a very early training stage. Our experiments sho… ▽ More

    Submitted 18 December, 2019; originally announced December 2019.

    Comments: 10 pages; accepted at Workshop "Beyond First-Order Optimization Methods in Machine Learning", 33rd Conference on Neural Information Processing Systems (NeurIPS 2019). arXiv admin note: substantial text overlap with arXiv:1807.06630

    MSC Class: I.2.6; I.5.1 ACM Class: I.2.6; I.5.1

  11. arXiv:1911.09432  [pdf, other

    cs.CR

    A Cryptoeconomic Traffic Analysis of Bitcoin's Lightning Network

    Authors: Ferenc Beres, Istvan Andras Seres, Andras A. Benczur

    Abstract: Lightning Network (LN) is designed to amend the scalability and privacy issues of Bitcoin. It's a payment channel network where Bitcoin transactions are issued off chain, onion routed through a private payment path with the aim to settle transactions in a faster, cheaper, and private manner, as they're not recorded in a costly-to-maintain, slow, and public ledger. In this work, we design a traffic… ▽ More

    Submitted 13 July, 2020; v1 submitted 21 November, 2019; originally announced November 2019.

    Comments: Cryptoeconomic Systems (CES) '20 Journal & Conference 7-8 March 2020, MIT, Cambridge, MA

  12. arXiv:1807.06630  [pdf, other

    cs.LG stat.ML

    Expressive power of outer product manifolds on feed-forward neural networks

    Authors: Bálint Daróczy, Rita Aleksziev, András Benczúr

    Abstract: Hierarchical neural networks are exponentially more efficient than their corresponding "shallow" counterpart with the same expressive power, but involve huge number of parameters and require tedious amounts of training. Our main idea is to mathematically understand and describe the hierarchical structure of feedforward neural networks by reparametrization invariant Riemannian metrics. By computing… ▽ More

    Submitted 17 July, 2018; originally announced July 2018.

    Comments: 11 pages, 8 figures, under submission

  13. arXiv:1802.05872  [pdf, other

    cs.DC cs.LG stat.ML

    Online Machine Learning in Big Data Streams

    Authors: András A. Benczúr, Levente Kocsis, Róbert Pálovics

    Abstract: The area of online machine learning in big data streams covers algorithms that are (1) distributed and (2) work from data streams with only a limited possibility to store past data. The first requirement mostly concerns software architectures and efficient algorithms. The second one also imposes nontrivial theoretical restrictions on the modeling methods: In the data stream model, older data is no… ▽ More

    Submitted 16 February, 2018; originally announced February 2018.

  14. arXiv:1701.00406  [pdf, other

    cs.SI physics.soc-ph

    Raising Graphs From Randomness to Reveal Information Networks

    Authors: Róbert Pálovics, András A. Benczúr

    Abstract: We analyze the fine-grained connections between the average degree and the power-law degree distribution exponent in growing information networks. Our starting observation is a power-law degree distribution with a decreasing exponent and increasing average degree as a function of the network size. Our experiments are based on three Twitter at-mention networks and three more from the Koblenz Networ… ▽ More

    Submitted 2 January, 2017; originally announced January 2017.

  15. arXiv:1611.01974  [pdf, other

    cs.IR

    Item-to-item recommendation based on Contextual Fisher Information

    Authors: Bálint Daróczy, Frederick Ayala-Gómez, András Benczúr

    Abstract: Web recommendation services bear great importance in e-commerce, as they aid the user in navigating through the items that are most relevant to her needs. In a typical Web site, long history of previous activities or purchases by the user is rarely available. Hence in most cases, recommenders propose items that are similar to the most recent ones viewed in the current user session. The correspondi… ▽ More

    Submitted 8 November, 2016; v1 submitted 7 November, 2016; originally announced November 2016.

    Comments: 9 pages, 8 figures, 4 tables

  16. arXiv:1505.03002  [pdf, other

    physics.soc-ph cs.IR cs.SI

    Statistical analysis of NOMAO customer votes for spots of France

    Authors: Robert Palovics, Balint Daroczy, Andras Benczur, Julia Pap, Leonardo Ermann, Samuel Phan, Alexei D. Chepelianskii, Dima L. Shepelyansky

    Abstract: We investigate the statistical properties of votes of customers for spots of France collected by the startup company NOMAO. The frequencies of votes per spot and per customer are characterized by a power law distributions which remain stable on a time scale of a decade when the number of votes is varied by almost two orders of magnitude. Using the computer science methods we explore the spectrum a… ▽ More

    Submitted 12 May, 2015; originally announced May 2015.

    Comments: 10 pages, 12 figs

    Journal ref: Eur. Phys. J. B. v.88, p.194 (2015)

  17. arXiv:1307.7142  [pdf, other

    cs.SI physics.soc-ph

    Temporal influence over the Last.fm social network

    Authors: Róbert Pálovics, András A. Benczúr

    Abstract: Several recent results show the influence of social contacts to spread certain properties over the network, but others question the methodology of these experiments by proposing that the measured effects may be due to homophily or a shared environment. In this paper we justify the existence of the social influence by considering the temporal behavior of Last.fm users. In order to clearly distingui… ▽ More

    Submitted 28 July, 2013; originally announced July 2013.

    Comments: 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, ASONAM 2013

  18. arXiv:1304.6601  [pdf, ps, other

    physics.soc-ph cs.IR cs.SI

    Time evolution of Wikipedia network ranking

    Authors: Young-Ho Eom, Klaus M. Frahm, András Benczúr, Dima L. Shepelyansky

    Abstract: We study the time evolution of ranking and spectral properties of the Google matrix of English Wikipedia hyperlink network during years 2003 - 2011. The statistical properties of ranking of Wikipedia articles via PageRank and CheiRank probabilities, as well as the matrix spectrum, are shown to be stabilized for 2007 - 2011. A special emphasis is done on ranking of Wikipedia personalities and unive… ▽ More

    Submitted 31 October, 2013; v1 submitted 24 April, 2013; originally announced April 2013.

    Comments: 10 pages, 11 figures. Accepted for publication in EPJB

    Journal ref: Eur. Phys. J. B. (2013) 86: 492

  19. arXiv:1006.0289  [pdf, ps, other

    cs.IR cs.AI

    Métodos para la Selección y el Ajuste de Características en el Problema de la Detección de Spam

    Authors: Carlos M. Lorenzetti, Rocío L. Cecchini, Ana G. Maguitman, András A. Benczúr

    Abstract: The email is used daily by millions of people to communicate around the globe and it is a mission-critical application for many businesses. Over the last decade, unsolicited bulk email has become a major problem for email users. An overwhelming amount of spam is flowing into users' mailboxes daily. In 2004, an estimated 62% of all email was attributed to spam. Spam is not only frustrating for most… ▽ More

    Submitted 14 October, 2010; v1 submitted 1 June, 2010; originally announced June 2010.

    Comments: 5 pages, 1 figure, Workshop de Investigadores en Ciencias de la Computación, WICC 2010, pp 48-52

    MSC Class: 68P20 ACM Class: H.3.3

    Journal ref: Workshop de Investigadores en Ciencias de la Computacion, WICC 2010, El Calafate, Santa Cruz, Argentina

  20. arXiv:cs/0207078  [pdf, ps, other

    cs.DS cs.DM

    Randomized Approximation Schemes for Cuts and Flows in Capacitated Graphs

    Authors: Andras Benczur, David R. Karger

    Abstract: We improve on random sampling techniques for approximately solving problems that involve cuts and flows in graphs. We give a near-linear-time construction that transforms any graph on n vertices into an O(n\log n)-edge graph on the same vertices whose cuts have approximately the same value as the original graph's. In this new graph, for example, we can run the O(m^{3/2})-time maximum flow algori… ▽ More

    Submitted 23 July, 2002; originally announced July 2002.

    Comments: Draft journal version combining conference publications in STOC '96 and SODA '98

    ACM Class: F.2.2; G.2.1; G.2.2