Skip to main content

Showing 1–4 of 4 results for author: Alpay, F

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.18129  [pdf, ps, other

    cs.CL cs.AI

    $φ^{\infty}$: Clause Purification, Embedding Realignment, and the Total Suppression of the Em Dash in Autoregressive Language Models

    Authors: Bugra Kilictas, Faruk Alpay

    Abstract: We identify a critical vulnerability in autoregressive transformer language models where the em dash token induces recursive semantic drift, leading to clause boundary hallucination and embedding space entanglement. Through formal analysis of token-level perturbations in semantic lattices, we demonstrate that em dash insertion fundamentally alters the model's latent representations, causing compou… ▽ More

    Submitted 22 June, 2025; originally announced June 2025.

    Comments: 16 pages, 3 figures

    MSC Class: 68T50; 68T45; 03B70 ACM Class: I.2.6; I.2.7; I.2.3; F.4.1

  2. arXiv:2506.06870  [pdf, ps, other

    cs.LO cs.AI

    Recursive Semantic Anchoring in ISO 639:2023: A Structural Extension to ISO/TC 37 Frameworks

    Authors: Bugra Kilictas, Faruk Alpay

    Abstract: ISO 639:2023 unifies the ISO language-code family and introduces contextual metadata, but it lacks a machine-native mechanism for handling dialectal drift and creole mixtures. We propose a formalisation of recursive semantic anchoring, attaching to every language entity $χ$ a family of fixed-point operators $φ_{n,m}$ that model bounded semantic drift via the relation $φ_{n,m}(χ) = χ\oplus Δ(χ)$, w… ▽ More

    Submitted 7 June, 2025; originally announced June 2025.

    Comments: 21 pages, no figures. Includes formal proofs, RDF/Turtle ontology schema, φ-index disambiguation cases, and evaluation of transformer-based AI models under semantic drift

    MSC Class: 03B70; 18M05; 68T50 ACM Class: F.4.1; I.2.7

  3. arXiv:2505.11927  [pdf, ps, other

    cs.DS cs.IT math.NA

    XiSort: Deterministic Sorting via IEEE-754 Total Ordering and Entropy Minimization

    Authors: Faruk Alpay

    Abstract: We introduce XiSort, a deterministic and reproducible sorting algorithm for floating-point sequences based on IEEE-754 total ordering and entropy minimization. XiSort guarantees bit-for-bit stability across runs and platforms by resolving tie-breaking via information-theoretic and symbolic methods. The algorithm supports both in-memory and external (out-of-core) operation, offering consistent perf… ▽ More

    Submitted 17 May, 2025; originally announced May 2025.

    Comments: 23 pages, 1 table. Source code: https://github.com/farukalpay/XiSort. Immutable archive: https://arweave.net/Lz8tBkiFyEsq6HjJ82UO8pq4p_fyfROKbQwEkAYrOKs. No prior conference submission

    MSC Class: 68P10; 68Q25; 94A17; 65Y20 ACM Class: F.2.2; G.4; E.1; G.3

  4. arXiv:2505.09239  [pdf, ps, other

    cs.LG

    Stable and Convexified Information Bottleneck Optimization via Symbolic Continuation and Entropy-Regularized Trajectories

    Authors: Faruk Alpay

    Abstract: The Information Bottleneck (IB) method frequently suffers from unstable optimization, characterized by abrupt representation shifts near critical points of the IB trade-off parameter, beta. In this paper, I introduce a novel approach to achieve stable and convex IB optimization through symbolic continuation and entropy-regularized trajectories. I analytically prove convexity and uniqueness of the… ▽ More

    Submitted 14 May, 2025; originally announced May 2025.

    Comments: 23 pages, 11 figures, includes analytical proofs, sensitivity analysis (95% CI), and JAX-based open-source implementation available at: https://github.com/farukalpay/information-bottleneck-beta-optimization

    MSC Class: 68T05; 90C25; 94A15 ACM Class: I.2.6; G.1.6; H.1.1