OMPPM: Online multiple palindrome pattern matching

Hwee Kim, Yo Sub Han

Research output: Contribution to journalArticle

Abstract

Motivation: A palindrome is a string that reads the same forward and backward. Finding palindromic substructures is important in DNA, RNA or protein sequence analysis. We say that two strings of the same length are pal-equivalent if, for each possible centre, they have the same length of the maximal palindrome. Given a text T of length n and a pattern P of length m, we study the palindrome pattern matching problem that finds all indices i such that P and T[i-m+1:i] are pal-equivalent. Results: We first solve the online palindrome pattern matching problem in O(m2) preprocessing time and O(mn) query time using O(m2) space. We then extend the problem for multiple patterns and solve the online multiple palindrome pattern matching problem in O(mkM) preprocessing time and O(mkn+c) query time using O(mkM) space, where M is the sum of all pattern lengths, mk is the longest pattern length and c is the number of pattern occurrences.

Original languageEnglish
Pages (from-to)1151-1157
Number of pages7
JournalBioinformatics
Volume32
Issue number8
DOIs
Publication statusPublished - 2016 Apr 15

    Fingerprint

All Science Journal Classification (ASJC) codes

  • Statistics and Probability
  • Biochemistry
  • Molecular Biology
  • Computer Science Applications
  • Computational Theory and Mathematics
  • Computational Mathematics

Cite this