Skip to main content

Showing 1–33 of 33 results for author: Tsur, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.06409  [pdf, other

    cs.CR cs.AI cs.CL cs.CY cs.IT cs.LG

    HeavyWater and SimplexWater: Watermarking Low-Entropy Text Distributions

    Authors: Dor Tsur, Carol Xuan Long, Claudio Mayrink Verdun, Hsiang Hsu, Chen-Fu Chen, Haim Permuter, Sajani Vithana, Flavio P. Calmon

    Abstract: Large language model (LLM) watermarks enable authentication of text provenance, curb misuse of machine-generated text, and promote trust in AI systems. Current watermarks operate by changing the next-token predictions output by an LLM. The updated (i.e., watermarked) predictions depend on random side information produced, for example, by hashing previously generated tokens. LLM watermarking is par… ▽ More

    Submitted 6 June, 2025; originally announced June 2025.

  2. arXiv:2506.00573  [pdf, other

    cs.LG stat.ML

    Neural Estimation for Scaling Entropic Multimarginal Optimal Transport

    Authors: Dor Tsur, Ziv Goldfeld, Kristjan Greenewald, Haim Permuter

    Abstract: Multimarginal optimal transport (MOT) is a powerful framework for modeling interactions between multiple distributions, yet its applicability is bottlenecked by a high computational overhead. Entropic regularization provides computational speedups via the multimarginal Sinkhorn algorithm, whose time complexity, for a dataset size $n$ and $k$ marginals, generally scales as $O(n^k)$. However, this d… ▽ More

    Submitted 31 May, 2025; originally announced June 2025.

  3. arXiv:2505.08878  [pdf, other

    cs.CR cs.AI cs.IT

    Optimized Couplings for Watermarking Large Language Models

    Authors: Dor Tsur, Carol Xuan Long, Claudio Mayrink Verdun, Hsiang Hsu, Haim Permuter, Flavio P. Calmon

    Abstract: Large-language models (LLMs) are now able to produce text that is, in many cases, seemingly indistinguishable from human-generated content. This has fueled the development of watermarks that imprint a ``signal'' in LLM-generated text with minimal perturbation of an LLM's output. This paper provides an analysis of text watermarking in a one-shot setting. Through the lens of hypothesis testing with… ▽ More

    Submitted 13 May, 2025; originally announced May 2025.

    Comments: Accepted at ISIT25

  4. arXiv:2502.19983  [pdf, other

    cs.LG

    Efficient Time Series Forecasting via Hyper-Complex Models and Frequency Aggregation

    Authors: Eyal Yakir, Dor Tsur, Haim Permuter

    Abstract: Time series forecasting is a long-standing problem in statistics and machine learning. One of the key challenges is processing sequences with long-range dependencies. To that end, a recent line of work applied the short-time Fourier transform (STFT), which partitions the sequence into multiple subsequences and applies a Fourier transform to each separately. We propose the Frequency Information Agg… ▽ More

    Submitted 27 February, 2025; originally announced February 2025.

    Comments: 12 pages, 5 figures. Still awaiting conference submission approval

    MSC Class: 62M20; 42A16; 68T05; 15A66 62M20; 42A16; 68T05; 15A66 62M20; 42A16; 68T05; 15A66 ACM Class: I.2.6; I.5.1

  5. arXiv:2501.06452  [pdf, other

    cs.DS

    Faster parameterized algorithm for 3-Hitting Set

    Authors: Dekel Tsur

    Abstract: In the 3-Hitting Set problem, the input is a hypergraph $G$ such that the size of every hyperedge of $G$ is at most 3, and an integers $k$, and the goal is to decide whether there is a set $S$ of at most $k$ vertices such that every hyperedge of $G$ contains at least one vertex from $S$. In this paper we give an $O^*(2.0409^k)$-time algorithm for 3-Hitting Set.

    Submitted 11 January, 2025; originally announced January 2025.

  6. arXiv:2405.16463  [pdf, ps, other

    cs.IT

    InfoMat: A Tool for the Analysis and Visualization Sequential Information Transfer

    Authors: Dor Tsur, Haim Permuter

    Abstract: Despite the popularity of information measures in analysis of probabilistic systems, proper tools for their visualization are not common. This work develops a simple matrix representation of information transfer in sequential systems, termed information matrix (InfoMat). The simplicity of the InfoMat provides a new visual perspective on existing decomposition formulas of mutual information, and en… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

    Comments: Accepted at ISIT2024

  7. arXiv:2402.06919  [pdf, other

    cs.IT cs.LG

    TREET: TRansfer Entropy Estimation via Transformers

    Authors: Omer Luxembourg, Dor Tsur, Haim Permuter

    Abstract: Transfer entropy (TE) is an information theoretic measure that reveals the directional flow of information between processes, providing valuable insights for a wide range of real-world applications. This work proposes Transfer Entropy Estimation via Transformers (TREET), a novel attention-based approach for estimating TE for stationary processes. The proposed approach employs Donsker-Varadhan repr… ▽ More

    Submitted 14 May, 2025; v1 submitted 10 February, 2024; originally announced February 2024.

    Comments: This work has been submitted to the IEEE for possible publication

  8. arXiv:2309.16200  [pdf, other

    cs.LG cs.IT

    Max-Sliced Mutual Information

    Authors: Dor Tsur, Ziv Goldfeld, Kristjan Greenewald

    Abstract: Quantifying the dependence between high-dimensional random variables is central to statistical learning and inference. Two classical methods are canonical correlation analysis (CCA), which identifies maximally correlated projected versions of the original variables, and Shannon's mutual information, which is a universal dependence measure that also captures high-order dependencies. However, CCA on… ▽ More

    Submitted 28 September, 2023; originally announced September 2023.

    Comments: Accepted at NeurIPS 2023

  9. arXiv:2301.00621  [pdf, ps, other

    cs.IT cs.LG

    Data-Driven Optimization of Directed Information over Discrete Alphabets

    Authors: Dor Tsur, Ziv Aharoni, Ziv Goldfeld, Haim Permuter

    Abstract: Directed information (DI) is a fundamental measure for the study and analysis of sequential stochastic models. In particular, when optimized over input distributions it characterizes the capacity of general communication channels. However, analytic computation of DI is typically intractable and existing optimization techniques over discrete input alphabets require knowledge of the channel model, w… ▽ More

    Submitted 2 January, 2023; originally announced January 2023.

  10. arXiv:2203.14743  [pdf, ps, other

    cs.IT

    Neural Estimation and Optimization of Directed Information over Continuous Spaces

    Authors: Dor Tsur, Ziv Aharoni, Ziv Goldfeld, Haim Permuter

    Abstract: This work develops a new method for estimating and optimizing the directed information rate between two jointly stationary and ergodic stochastic processes. Building upon recent advances in machine learning, we propose a recurrent neural network (RNN)-based estimator which is optimized via gradient ascent over the RNN parameters. The estimator does not require prior knowledge of the underlying joi… ▽ More

    Submitted 28 March, 2022; originally announced March 2022.

    Comments: 38 pages, 6 figures

  11. arXiv:2003.04179  [pdf, ps, other

    cs.IT cs.LG

    Capacity of Continuous Channels with Memory via Directed Information Neural Estimator

    Authors: Ziv Aharoni, Dor Tsur, Ziv Goldfeld, Haim Henry Permuter

    Abstract: Calculating the capacity (with or without feedback) of channels with memory and continuous alphabets is a challenging task. It requires optimizing the directed information (DI) rate over all channel input distributions. The objective is a multi-letter expression, whose analytic solution is only known for a few specific cases. When no analytic solution is present or the channel model is unknown, th… ▽ More

    Submitted 16 May, 2020; v1 submitted 9 March, 2020; originally announced March 2020.

  12. arXiv:1910.07944  [pdf, ps, other

    cs.DS

    Faster parameterized algorithm for Bicluter Editing

    Authors: Dekel Tsur

    Abstract: In the Bicluter Editing problem the input is a graph $G$ and an integer $k$, and the goal is to decide whether $G$ can be transformed into a bicluster graph by adding and removing at most $k$ edges. In this paper we give an algorithm for Bicluster Editing whose running time is $O^*(3.116^k)$.

    Submitted 17 October, 2019; originally announced October 2019.

  13. arXiv:1908.07318  [pdf, ps, other

    cs.DS

    An algorithm for destroying claws and diamonds

    Authors: Dekel Tsur

    Abstract: In the {Claw,Diamond}-Free Edge Deletion problem the input is a graph $G$ and an integer $k$, and the goal is to decide whether there is a set of edges of size at most $k$ such that removing the edges of the set from $G$ results a graph that does not contain an induced claw or diamond. In this paper we give an algorithm for this problem whose running time is $O^*(3.562^k)$.

    Submitted 20 August, 2019; originally announced August 2019.

  14. arXiv:1908.03600  [pdf, ps, other

    cs.DS

    Kernel for Kt-free edge deletion

    Authors: Dekel Tsur

    Abstract: In the $K_t$-free edge deletion problem, the input is a graph $G$ and an integer $k$, and the goal is to decide whether there is a set of at most $k$ edges of $G$ whose removal results a graph with no clique of size $t$. In this paper we give a kernel to this problem with $O(k^{t-1})$ vertices and edges.

    Submitted 9 August, 2019; originally announced August 2019.

  15. arXiv:1908.01223  [pdf, other

    cs.DS cs.DM math.CO

    Faster algorithms for cograph edge modification problems

    Authors: Dekel Tsur

    Abstract: In the Cograph Deletion (resp., Cograph Editing) problem the input is a graph $G$ and an integer $k$, and the goal is to decide whether there is a set of edges of size at most $k$ whose removal from $G$ (resp., removal and addition to $G$) results in a graph that does not contain an induced path with four vertices. In this paper we give algorithms for Cograph Deletion and Cograph Editing whose run… ▽ More

    Submitted 30 December, 2019; v1 submitted 3 August, 2019; originally announced August 2019.

  16. arXiv:1907.10230  [pdf, other

    cs.DS

    An FPT algorithm for orthogonal buttons and scissors

    Authors: Dekel Tsur

    Abstract: We study the puzzle game Buttons and Scissors in which the goal is to remove all buttons from an $n\times m$ grid by a series of horizontal and vertical cuts. We show that the corresponding parameterized problem has an algorithm with time complexity $2^{O(k^2 \log k)} (n+m)^{O(1)}$, where $k$ is an upper bound on the number of cuts.

    Submitted 24 July, 2019; originally announced July 2019.

  17. arXiv:1907.08399  [pdf, ps, other

    cs.DS

    Cluster deletion revisited

    Authors: Dekel Tsur

    Abstract: In the Cluster Deletion problem the input is a graph $G$ and an integer $k$, and the goal is to decide whether there is a set of at most $k$ edges whose removal from $G$ results a graph in which every connected component is a clique. In this paper we give an algorithm for Cluster Deletion whose running time is $O^*(1.404^k)$.

    Submitted 19 July, 2019; originally announced July 2019.

  18. arXiv:1906.10523  [pdf, other

    cs.DS

    l-path vertex cover is easier than l-hitting set for small l

    Authors: Dekel Tsur

    Abstract: In the $l$-path vertex cover problem the input is an undirected graph $G$ and an integer $k$. The goal is to decide whether there is a set of vertices $S$ of size at most $k$ such that $G-S$ does not contain a path with $l$ vertices. In this paper we give parameterized algorithms for $l$-path vertex cover for $l = 5,6,7$, whose time complexities are $O^*(3.945^k)$, $O^*(4.947^k)$, and… ▽ More

    Submitted 22 June, 2019; originally announced June 2019.

    Comments: arXiv admin note: text overlap with arXiv:1901.07609

  19. arXiv:1906.10012  [pdf, ps, other

    cs.DS

    Algorithms for deletion problems on split graphs

    Authors: Dekel Tsur

    Abstract: In the Split to Block Vertex Deletion and Split to Threshold Vertex Deletion problems the input is a split graph $G$ and an integer $k$, and the goal is to decide whether there is a set $S$ of at most $k$ vertices such that $G-S$ is a block graph and $G-S$ is a threshold graph, respectively. In this paper we give algorithms for these problems whose running times are $O^*(2.076^k)$ and… ▽ More

    Submitted 25 July, 2019; v1 submitted 24 June, 2019; originally announced June 2019.

  20. arXiv:1901.07609  [pdf, other

    cs.DS

    Faster parameterized algorithm for Cluster Vertex Deletion

    Authors: Dekel Tsur

    Abstract: In the Cluster Vertex Deletion problem the input is a graph $G$ and an integer $k$. The goal is to decide whether there is a set of vertices $S$ of size at most $k$ such that the deletion of the vertices of $S$ from $G$ results a graph in which every connected component is a clique. We give an algorithm for Cluster Vertex Deletion whose running time is $O^*(1.811^k)$.

    Submitted 22 January, 2019; originally announced January 2019.

  21. arXiv:1901.02491  [pdf, ps, other

    cs.DS

    Faster parameterized algorithm for pumpkin vertex deletion set

    Authors: Dekel Tsur

    Abstract: A directed graph $G$ is called a pumpkin if $G$ is a union of induced paths with a common start vertex $s$ and a common end vertex $t$, and the internal vertices of every two paths are disjoint. We give an algorithm that given a directed graph $G$ and an integer $k$, decides whether a pumpkin can be obtained from $G$ by deleting at most $k$ vertices. The algorithm runs in $O^*(2^k)$ time.

    Submitted 8 January, 2019; originally announced January 2019.

  22. arXiv:1812.10808  [pdf, ps, other

    cs.DS

    Above guarantee parameterization for vertex cover on graphs with maximum degree 4

    Authors: Dekel Tsur

    Abstract: In the vertex cover problem, the input is a graph $G$ and an integer $k$, and the goal is to decide whether there is a set of vertices $S$ of size at most $k$ such that every edge of $G$ is incident on at least one vertex in $S$. We study the vertex cover problem on graphs with maximum degree 4 and minimum degree at least 2, parameterized by $r = k-n/3$. We give an algorithm for this problem whose… ▽ More

    Submitted 27 December, 2018; originally announced December 2018.

  23. arXiv:1811.03592  [pdf, other

    cs.DS

    An O^*(2.619^k) algorithm for 4-path vertex cover

    Authors: Dekel Tsur

    Abstract: In the 4-path vertex cover problem, the input is an undirected graph $G$ and an integer $k$. The goal is to decide whether there is a set of vertices $S$ of size at most $k$ such that every path with 4 vertices in $G$ contains at least one vertex of $S$. In this paper we give a parameterized algorithm for 4-path vertex cover whose time complexity is $O^*(2.619^k)$.

    Submitted 5 January, 2019; v1 submitted 8 November, 2018; originally announced November 2018.

  24. arXiv:1810.12982  [pdf, ps, other

    cs.DS

    Weighted vertex cover on graphs with maximum degree 3

    Authors: Dekel Tsur

    Abstract: We give a parameterized algorithm for weighted vertex cover on graphs with maximum degree 3 whose time complexity is $O^*(1.402^t)$, where $t$ is the minimum size of a vertex cover of the input graph.

    Submitted 30 October, 2018; originally announced October 2018.

  25. arXiv:1809.02636  [pdf, other

    cs.DS

    Parameterized algorithm for 3-path vertex cover

    Authors: Dekel Tsur

    Abstract: In the 3-path vertex cover problem, the input is an undirected graph $G$ and an integer $k$. The goal is to decide whether there is a set of vertices $S$ of size at most $k$ such that every path with 3 vertices in $G$ contains at least one vertex of $S$. In this paper we give parameterized algorithm for 3-path cover whose time complexity is $O^*(1.713^k)$. Our algorithm is faster than previous alg… ▽ More

    Submitted 7 September, 2018; originally announced September 2018.

  26. arXiv:1808.04185  [pdf, ps, other

    cs.DS

    Faster deterministic parameterized algorithm for k-Path

    Authors: Dekel Tsur

    Abstract: In the k-Path problem, the input is a directed graph $G$ and an integer $k\geq 1$, and the goal is to decide whether there is a simple directed path in $G$ with exactly $k$ vertices. We give a deterministic algorithm for k-Path with time complexity $O^*(2.554^k)$. This improves the previously best deterministic algorithm for this problem of Zehavi [ESA 2015] whose time complexity is… ▽ More

    Submitted 24 January, 2019; v1 submitted 13 August, 2018; originally announced August 2018.

  27. arXiv:1808.03658  [pdf, other

    cs.DS

    The effective entropy of next/previous larger/smaller value queries

    Authors: Dekel Tsur

    Abstract: We study the problem of storing the minimum number of bits required to answer next/previous larger/smaller value queries on an array $A$ of $n$ numbers, without storing $A$. We show that these queries can be answered by storing at most $3.701 n$ bits. Our result improves the result of Jo and Satti [TCS 2016] that gives an upper bound of $4.088n$ bits for this problem.

    Submitted 10 August, 2018; originally announced August 2018.

  28. arXiv:1808.03553  [pdf, ps, other

    cs.DS

    Dynamic all scores matrices for LCS score

    Authors: Amir Carmel, Dekel Tsur, Michal Ziv-Ukelson

    Abstract: The problem of aligning two strings A,B in order to determine their similarity is fundamental in the field of pattern matching. An important concept in this domain is the "all scores matrix" that encodes the local alignment comparison of two strings. Namely, let K denote the all scores matrix containing the alignment score of every substring of B with A, and let J denote the all scores matrix cont… ▽ More

    Submitted 10 August, 2018; originally announced August 2018.

  29. arXiv:1807.00371  [pdf, ps, other

    cs.DS

    Representation of ordered trees with a given degree distribution

    Authors: Dekel Tsur

    Abstract: The degree distribution of an ordered tree $T$ with $n$ nodes is $\vec{n} = (n_0,\ldots,n_{n-1})$, where $n_i$ is the number of nodes in $T$ with $i$ children. Let $\mathcal{N}(\vec{n})$ be the number of trees with degree distribution $\vec{n}$. We give a data structure that stores an ordered tree $T$ with $n$ nodes and degree distribution $\vec{n}$ using $\log \mathcal{N}(\vec{n})+O(n/\log^t n)$… ▽ More

    Submitted 1 July, 2018; originally announced July 2018.

  30. arXiv:1805.11255  [pdf, ps, other

    cs.DS

    Succinct data structure for dynamic trees with faster queries

    Authors: Dekel Tsur

    Abstract: Navarro and Sadakane [TALG 2014] gave a dynamic succinct data structure for storing an ordinal tree. The structure supports tree queries in either $O(\log n/\log\log n)$ or $O(\log n)$ time, and insertion or deletion of a single node in $O(\log n)$ time. In this paper we improve the result of Navarro and Sadakane by reducing the time complexities of some queries (e.g.\ degree and level\_ancestor)… ▽ More

    Submitted 29 May, 2018; originally announced May 2018.

  31. arXiv:1609.01400  [pdf, ps, other

    cs.DS

    Succinct data-structure for nearest colored node in a tree

    Authors: Dekel Tsur

    Abstract: We give a succinct data-structure that stores a tree with colors on the nodes. Given a node x and a color alpha, the structure finds the nearest node to x with color alpha. This results improves the $O(n\log n)$-bits structure of Gawrychowski et al.~[CPM 2016].

    Submitted 18 February, 2017; v1 submitted 6 September, 2016; originally announced September 2016.

  32. arXiv:1312.6039  [pdf, other

    cs.DS

    Succinct representation of labeled trees

    Authors: Dekel Tsur

    Abstract: We give a representation for labeled ordered trees that supports labeled queries such as finding the i-th ancestor of a node with a given label. Our representation is succinct, namely the redundancy is small-o of the optimal space for storing the tree. This improves the representation of He et al. which is succinct unless the entropy of the labels is small.

    Submitted 20 December, 2013; originally announced December 2013.

  33. arXiv:1310.1440  [pdf, other

    cs.DS

    Approximate String Matching using a Bidirectional Index

    Authors: Gregory Kucherov, Kamil Salikhov, Dekel Tsur

    Abstract: We study strategies of approximate pattern matching that exploit bidirectional text indexes, extending and generalizing ideas of Lam et al. We introduce a formalism, called search schemes, to specify search strategies of this type, then develop a probabilistic measure for the efficiency of a search scheme, prove several combinatorial results on efficient search schemes, and finally, provide experi… ▽ More

    Submitted 6 September, 2015; v1 submitted 5 October, 2013; originally announced October 2013.