Skip to main content

Showing 1–7 of 7 results for author: Rahmann, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.05847  [pdf, other

    cs.DS cs.DB

    Smaller and More Flexible Cuckoo Filters

    Authors: Johanna Elena Schmitz, Jens Zentgraf, Sven Rahmann

    Abstract: Cuckoo filters are space-efficient approximate set membership data structures with a controllable false positive rate (FPR) and zero false negatives, similar to Bloom filters. In contrast to Bloom filters, Cuckoo filters store multi-bit fingerprints of keys in a hash table using variants of Cuckoo hashing, allowing each fingerprint to be stored at a small number of possible locations. Existing Cuc… ▽ More

    Submitted 9 May, 2025; originally announced May 2025.

  2. arXiv:2501.18977  [pdf, other

    cs.DB cs.DS

    Blocked Bloom Filters with Choices

    Authors: Johanna Elena Schmitz, Jens Zentgraf, Sven Rahmann

    Abstract: Probabilistic filters are approximate set membership data structures that represent a set of keys in small space, and answer set membership queries without false negative answers, but with a certain allowed false positive probability. Such filters are widely used in database systems, networks, storage systems and in biological sequence analysis because of their fast query times and low space requi… ▽ More

    Submitted 9 May, 2025; v1 submitted 31 January, 2025; originally announced January 2025.

  3. arXiv:1702.01703  [pdf, other

    q-bio.GN cs.DS

    Variant tolerant read mapping using min-hashing

    Authors: Jens Quedenfeld, Sven Rahmann

    Abstract: DNA read mapping is a ubiquitous task in bioinformatics, and many tools have been developed to solve the read mapping problem. However, there are two trends that are changing the landscape of readmapping: First, new sequencing technologies provide very long reads with high error rates (up to 15%). Second, many genetic variants in the population are known, so the reference genome is not considered… ▽ More

    Submitted 8 February, 2017; v1 submitted 6 February, 2017; originally announced February 2017.

    ACM Class: F.2.2

  4. arXiv:1405.5501  [pdf, other

    cs.OH

    Using the Expectation Maximization Algorithm with Heterogeneous Mixture Components for the Analysis of Spectrometry Data

    Authors: Dominik Kopczynski, Sven Rahmann

    Abstract: Coupling a multi-capillary column (MCC) with an ion mobility (IM) spectrometer (IMS) opened a multitude of new application areas for gas analysis, especially in a medical context, as volatile organic compounds (VOCs) in exhaled breath can hint at a person's state of health. To obtain a potential diagnosis from a raw MCC/IMS measurement, several computational steps are necessary, which so far have… ▽ More

    Submitted 21 May, 2014; originally announced May 2014.

  5. arXiv:1403.1706  [pdf, other

    cs.DS cs.DC q-bio.QM

    Massively parallel read mapping on GPUs with PEANUT

    Authors: Johannes Köster, Sven Rahmann

    Abstract: We present PEANUT (ParallEl AligNment UTility), a highly parallel GPU-based read mapper with several distinguishing features, including a novel q-gram index (called the q-group index) with small memory footprint built on-the-fly over the reads and the possibility to output both the best hits or all hits of a read. Designing the algorithm particularly for the GPU architecture, we were able to reach… ▽ More

    Submitted 7 March, 2014; originally announced March 2014.

  6. arXiv:1011.5778  [pdf, ps, other

    cs.FL q-bio.QM

    Probabilistic Arithmetic Automata and their Applications

    Authors: Tobias Marschall, Inke Herms, Hans-Michael Kaltenbach, Sven Rahmann

    Abstract: We present probabilistic arithmetic automata (PAAs), a general model to describe chains of operations whose operands depend on chance, along with two different algorithms to exactly calculate the distribution of the results obtained by such probabilistic calculations. PAAs provide a unifying framework to approach many problems arising in computational biology and elsewhere. Here, we present five d… ▽ More

    Submitted 26 November, 2010; originally announced November 2010.

    MSC Class: 62P10; 68W32; 68W40; 68Q87; 92B05; 92C40

  7. arXiv:1009.6114  [pdf, ps, other

    cs.DS cs.FL

    Exact Analysis of Pattern Matching Algorithms with Probabilistic Arithmetic Automata

    Authors: Tobias Marschall, Sven Rahmann

    Abstract: We propose a framework for the exact probabilistic analysis of window-based pattern matching algorithms, such as Boyer-Moore, Horspool, Backward DAWG Matching, Backward Oracle Matching, and more. In particular, we show how to efficiently obtain the distribution of such an algorithm's running time cost for any given pattern in a random text model, which can be quite general, from simple uniform mod… ▽ More

    Submitted 30 September, 2010; originally announced September 2010.

    MSC Class: 68W32; 68W40