GPU-friendly parallel genome matching with tiled access and reduced state transition table

Yunho Oh, Doohwan Oh, Won W. Ro

Research output: Contribution to journalArticlepeer-review

3 Citations (Scopus)


In this paper, we propose a new parallel genome matching algorithm using graphics processing units (GPUs). Our proposed approach is based on the Aho-Corasick algorithm and it was developed based on a consideration of the architectural features of existing GPUs with a hundred or more cores. Thus, we provide an appropriate task partitioning method that runs on multiple threads and we fully utilize the cache memory and the shared memory structures available in GPUs. Especially, we propose a tiled access method for rapid data transfer from the global memory to the shared memory. We also provide new models for cache-friendly state transition table to improve performance of pattern matching operations on GPUs. The maximum throughput we achieved in various experiments was 15.3 Gbps. Moreover, we showed that our proposed design outperformed an earlier approach with a 15.4 % performance improvement.

Original languageEnglish
Pages (from-to)526-551
Number of pages26
JournalInternational Journal of Parallel Programming
Issue number4
Publication statusPublished - 2013 Aug

Bibliographical note

Funding Information:
Acknowledgments This work was supported by the Basic Science Research Program through the National Research Foundation of Korea, which is funded by the Ministry of Education, Science and Technology [2009-0070364].

All Science Journal Classification (ASJC) codes

  • Software
  • Theoretical Computer Science
  • Information Systems


Dive into the research topics of 'GPU-friendly parallel genome matching with tiled access and reduced state transition table'. Together they form a unique fingerprint.

Cite this