Skip to main content

Showing 1–5 of 5 results for author: Thompson, S M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.09772  [pdf, ps, other

    cs.LO cs.FL

    Characterization and Decidability of FC-Definable Regular Languages

    Authors: Sam M. Thompson, Nicole Schweikardt, Dominik D. Freydenberger

    Abstract: FC is a first-order logic that reasons over all factors of a finite word using concatenation, and can define non-regular languages like that of all squares (ww). In this paper, we establish that there are regular languages that are not FC-definable. Moreover, we give a decidable characterization of the FC-definable regular languages in terms of algebra, automata, and regular expressions. The latte… ▽ More

    Submitted 14 May, 2025; originally announced May 2025.

    Comments: To appear in LICS 2025

  2. arXiv:2306.16364  [pdf, ps, other

    cs.LO cs.DB cs.FL

    Generalized Core Spanner Inexpressibility via Ehrenfeucht-Fraïssé Games for FC

    Authors: Sam M. Thompson, Dominik D. Freydenberger

    Abstract: Despite considerable research on document spanners, little is known about the expressive power of generalized core spanners. In this paper, we use Ehrenfeucht-Fraïssé games to obtain general inexpressibility lemmas for the logic FC (a finite-model variant of the theory of concatenation). Applying these lemmas give inexpressibility results for FC that we lift to generalized core spanners. In partic… ▽ More

    Submitted 28 June, 2023; originally announced June 2023.

  3. arXiv:2208.01298  [pdf, ps, other

    cs.LO cs.DB cs.FL

    Conjunctive Queries for Logic-Based Information Extraction

    Authors: Sam M. Thompson

    Abstract: This thesis offers two logic-based approaches to conjunctive queries in the context of information extraction. The first and main approach is the introduction of conjunctive query fragments of the logics FC and FC[REG], denoted as FC-CQ and FC[REG]-CQ respectively. FC is a first-order logic based on word equations, where the semantics are defined by limiting the universe to the factors of some fin… ▽ More

    Submitted 2 August, 2022; originally announced August 2022.

    Comments: Based on the author's PhD thesis and contains work from two conference publications (arXiv:2104.04758, arXiv:1909.10869) which are joint work with Dominik D. Freydenberger

  4. arXiv:2104.04758  [pdf, other

    cs.DB cs.LO

    Splitting Spanner Atoms: A Tool for Acyclic Core Spanners

    Authors: Dominik D. Freydenberger, Sam M. Thompson

    Abstract: This paper investigates regex CQs with string equalities (SERCQs), a subclass of core spanners. As shown by Freydenberger, Kimelfeld, and Peterfreund (PODS 2018), these queries are intractable, even if restricted to acyclic queries. This previous result defines acyclicity by treating regex formulas as atoms. In contrast to this, we propose an alternative definition by converting SERCQs into FC-CQs… ▽ More

    Submitted 19 January, 2022; v1 submitted 10 April, 2021; originally announced April 2021.

  5. arXiv:1909.10869  [pdf, other

    cs.LO

    Dynamic Complexity of Document Spanners

    Authors: Dominik D. Freydenberger, Sam M. Thompson

    Abstract: The present paper investigates the dynamic complexity of document spanners, a formal framework for information extraction introduced by Fagin, Kimelfeld, Reiss, and Vansummeren (JACM 2015). We first look at the class of regular spanners and prove that any regular spanner can be maintained in the dynamic complexity class DynPROP. This result follows from work done previously on the dynamic complexi… ▽ More

    Submitted 9 January, 2020; v1 submitted 24 September, 2019; originally announced September 2019.