Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Input-independent, Scalable and Fast String Matching on the Cray XMT

Conference ·

String searching is at the core of many security and network applications like search engines, intrusion detection systems, virus scanners and spam filters. The growing size of on-line content and the increasing wire speeds push the need for fast, and often real- time, string searching solutions. For these conditions, many software implementations (if not all) targeting conventional cache-based microprocessors do not perform well. They either exhibit overall low performance or exhibit highly variable performance depending on the types of inputs. For this reason, real-time state of the art solutions rely on the use of either custom hardware or Field-Programmable Gate Arrays (FPGAs) at the expense of overall system flexibility and programmability. This paper presents a software based implementation of the Aho-Corasick string searching algorithm on the Cray XMT multithreaded shared memory machine. Our so- lution relies on the particular features of the XMT architecture and on several algorith- mic strategies: it is fast, scalable and its performance is virtually content-independent. On a 128-processor Cray XMT, it reaches a scanning speed of ≈ 28 Gbps with a performance variability below 10 %. In the 10 Gbps performance range, variability is below 2.5%. By comparison, an Intel dual-socket, 8-core system running at 2.66 GHz achieves a peak performance which varies from 500 Mbps to 10 Gbps depending on the type of input and dictionary size.

Research Organization:
Pacific Northwest National Laboratory (PNNL), Richland, WA (US)
Sponsoring Organization:
USDOE
DOE Contract Number:
AC05-76RL01830
OSTI ID:
973738
Report Number(s):
PNNL-SA-62783; 400470000
Country of Publication:
United States
Language:
English

Similar Records

Aho-Corasick String Matching on Shared and Distributed Memory Parallel Architectures
Journal Article · Wed Feb 29 23:00:00 EST 2012 · IEEE Transactions on Parallel and Distributed Systems · OSTI ID:1034574

Fast and Accurate Simulation of the Cray XMT Multithreaded Supercomputer
Journal Article · Sun Dec 30 23:00:00 EST 2012 · IEEE Transactions on Parallel and Distributed Systems, 23(12):2266-2279 · OSTI ID:1088646

Efficient Aho-Corasick String Matching on Emerging Multicore Architectures
Book · Wed Dec 11 23:00:00 EST 2013 · OSTI ID:1182352