Skip to main content

Showing 1–3 of 3 results for author: Gerlach, M

Searching in archive stat. Search in all archives.
.
  1. arXiv:2106.15821  [pdf, other

    cs.SI physics.soc-ph stat.ML

    Multilayer Networks for Text Analysis with Multiple Data Types

    Authors: Charles C. Hyland, Yuanming Tao, Lamiae Azizi, Martin Gerlach, Tiago P. Peixoto, Eduardo G. Altmann

    Abstract: We are interested in the widespread problem of clustering documents and finding topics in large collections of written documents in the presence of metadata and hyperlinks. To tackle the challenge of accounting for these different types of datasets, we propose a novel framework based on Multilayer Networks and Stochastic Block Models. The main innovation of our approach over other techniques is th… ▽ More

    Submitted 30 June, 2021; originally announced June 2021.

    Comments: 17 pages, 6 figures

    Journal ref: EPJ Data Science volume 10, Article number: 33 (2021)

  2. arXiv:1801.10108  [pdf, ps, other

    stat.ML math.AP math.DG math.PR

    Error estimates for spectral convergence of the graph Laplacian on random geometric graphs towards the Laplace--Beltrami operator

    Authors: Nicolas Garcia Trillos, Moritz Gerlach, Matthias Hein, Dejan Slepcev

    Abstract: We study the convergence of the graph Laplacian of a random geometric graph generated by an i.i.d. sample from a $m$-dimensional submanifold $M$ in $R^d$ as the sample size $n$ increases and the neighborhood size $h$ tends to zero. We show that eigenvalues and eigenvectors of the graph Laplacian converge with a rate of $O\Big(\big(\frac{\log n}{n}\big)^\frac{1}{2m}\Big)$ to the eigenvalues and eig… ▽ More

    Submitted 30 January, 2018; originally announced January 2018.

    MSC Class: 62G20; 65N25; 60D05; 58J50; 68R10; 05C50

  3. arXiv:1708.01677  [pdf, other

    stat.ML cs.CL physics.data-an physics.soc-ph

    A network approach to topic models

    Authors: Martin Gerlach, Tiago P. Peixoto, Eduardo G. Altmann

    Abstract: One of the main computational and scientific challenges in the modern age is to extract useful information from unstructured texts. Topic models are one popular machine-learning approach which infers the latent topical structure of a collection of documents. Despite their success --- in particular of its most widely used variant called Latent Dirichlet Allocation (LDA) --- and numerous application… ▽ More

    Submitted 19 July, 2018; v1 submitted 4 August, 2017; originally announced August 2017.

    Comments: 22 pages, 10 figures, code available at https://topsbm.github.io/

    Journal ref: Science Advances 4, eaaq1360 (2018)