Skip to main content

Showing 1–5 of 5 results for author: Constantin, C

Searching in archive cs. Search in all archives.
.
  1. Leiden-Fusion Partitioning Method for Effective Distributed Training of Graph Embeddings

    Authors: Yuhe Bai, Camelia Constantin, Hubert Naacke

    Abstract: In the area of large-scale training of graph embeddings, effective training frameworks and partitioning methods are critical for handling large networks. However, they face two major challenges: 1) existing synchronized distributed frameworks require continuous communication to access information from other machines, and 2) the inability of current partitioning methods to ensure that subgraphs rem… ▽ More

    Submitted 15 September, 2024; originally announced September 2024.

    Comments: Accepted at the 2024 European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML-PKDD 2024)

  2. arXiv:2306.02221  [pdf, other

    cs.IR cs.AI

    ATEM: A Topic Evolution Model for the Detection of Emerging Topics in Scientific Archives

    Authors: Hamed Rahimi, Hubert Naacke, Camelia Constantin, Bernd Amann

    Abstract: This paper presents ATEM, a novel framework for studying topic evolution in scientific archives. ATEM is based on dynamic topic modeling and dynamic graph embedding techniques that explore the dynamics of content and citations of documents within a scientific corpus. ATEM explores a new notion of contextual emergence for the discovery of emerging interdisciplinary research topics based on the dyna… ▽ More

    Submitted 3 June, 2023; originally announced June 2023.

  3. arXiv:2305.14587  [pdf, other

    cs.CL cs.IR

    Contextualized Topic Coherence Metrics

    Authors: Hamed Rahimi, Jacob Louis Hoover, David Mimno, Hubert Naacke, Camelia Constantin, Bernd Amann

    Abstract: The recent explosion in work on neural topic modeling has been criticized for optimizing automated topic evaluation metrics at the expense of actual meaningful topic identification. But human annotation remains expensive and time-consuming. We propose LLM-based methods inspired by standard human topic evaluations, in a family of metrics called Contextualized Topic Coherence (CTC). We evaluate both… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

  4. arXiv:2302.01501  [pdf, other

    cs.IR cs.AI cs.LG cs.NE cs.SI

    ANTM: An Aligned Neural Topic Model for Exploring Evolving Topics

    Authors: Hamed Rahimi, Hubert Naacke, Camelia Constantin, Bernd Amann

    Abstract: This paper presents an algorithmic family of dynamic topic models called Aligned Neural Topic Models (ANTM), which combine novel data mining algorithms to provide a modular framework for discovering evolving topics. ANTM maintains the temporal continuity of evolving topics by extracting time-aware features from documents using advanced pre-trained Large Language Models (LLMs) and employing an over… ▽ More

    Submitted 4 June, 2023; v1 submitted 2 February, 2023; originally announced February 2023.

  5. arXiv:2108.01756  [pdf, other

    cs.LO math.CT

    Localisable Monads

    Authors: Carmen Constantin, Nuiok Dicaire, Chris Heunen

    Abstract: Monads govern computational side-effects in programming semantics. They can be combined in a ''bottom-up'' way to handle several instances of such effects. Indexed monads and graded monads do this in a modular way. Here, instead, we equip monads with fine-grained structure in a ''top-down'' way, using techniques from tensor topology. This provides an intrinsic theory of local computational effects… ▽ More

    Submitted 3 August, 2021; originally announced August 2021.

    Comments: 24 pages, 1 figure