-
Approximate String Matching using a Bidirectional Index
Abstract: We study strategies of approximate pattern matching that exploit bidirectional text indexes, extending and generalizing ideas of Lam et al. We introduce a formalism, called search schemes, to specify search strategies of this type, then develop a probabilistic measure for the efficiency of a search scheme, prove several combinatorial results on efficient search schemes, and finally, provide experi… ▽ More
Submitted 6 September, 2015; v1 submitted 5 October, 2013; originally announced October 2013.
-
Using cascading Bloom filters to improve the memory usage for de Brujin graphs
Abstract: De Brujin graphs are widely used in bioinformatics for processing next-generation sequencing data. Due to a very large size of NGS datasets, it is essential to represent de Bruijn graphs compactly, and several approaches to this problem have been proposed recently. In this work, we show how to reduce the memory required by the algorithm of [3] that represents de Brujin graphs using Bloom filters.… ▽ More
Submitted 21 May, 2013; v1 submitted 28 February, 2013; originally announced February 2013.
Comments: 12 pages, submitted
ACM Class: E.2; J.3