CMU-CS-09-159 Computer Science Department School of Computer Science, Carnegie Mellon University
Fast Cache for Your Text: Accelerating Exact Iulian Moraru, David G. Andersen September 2009
This paper presents an algorithm for exact pattern matching based on a new type of Bloom filter that we call a feed-forward Bloom filter. Besides filtering the input corpus, a feed-forward Bloom filter is also able to reduce the set of patterns needed for the exact matching phase. We show that this technique, along with a CPU architecture aware design of the Bloom filter, can provide speedups between 2x and 30x, and memory consumption reductions as large as 50x when compared with grep, while the filtering speed can be as much as 5x higher than that of a normal Bloom filters. 26 pages
| |
Return to:
SCS Technical Report Collection This page maintained by [email protected] |