A multi-dimensional indexing approach for timestamped event sequence matching

Sanghyun Park, Jung Im Won, Jee Hee Yoon, Sang Wook Kim

Research output: Contribution to journalArticlepeer-review

5 Citations (Scopus)


This paper addresses the problem of timestamped event sequence matching, a new type of similar sequence matching that retrieves the occurrences of interesting patterns from timestamped sequence databases. The sequential-scan-based method, the trie-based method, and the method based on the iso-depth index are well-known approaches to this problem. In this paper, we point out their shortcomings, and propose a new method that effectively overcomes these shortcomings. The proposed method employs an R*-tree, a widely accepted multi-dimensional index structure that efficiently supports timestamped event sequence matching. To build the R*-tree, this method extracts time windows from every item in a timestamped event sequence and represents them as rectangles in n-dimensional space by considering the first and last occurring times of each event type. Here, n is the total number of disparate event types that may occur in a target application. To resolve the dimensionality curse in the case when n is large, we suggest an algorithm for reducing the dimensionality by grouping the event types. Our sequence matching method based on the R*-tree performs with two steps. First, it efficiently identifies a small number of candidates by searching the R*-tree. Second, it picks out true answers from the set of candidates. We prove its robustness formally, and also show its effectiveness via extensive experiments.

Original languageEnglish
Pages (from-to)4859-4876
Number of pages18
JournalInformation sciences
Issue number22
Publication statusPublished - 2007 Nov 15

Bibliographical note

Funding Information:
This work was partially supported by the Korea Research Foundation Grant funded from the Korean Government (KRF-2005-041-D00651) and by the research fund of Hanyang University (HY-2005-I).

All Science Journal Classification (ASJC) codes

  • Software
  • Control and Systems Engineering
  • Theoretical Computer Science
  • Computer Science Applications
  • Information Systems and Management
  • Artificial Intelligence


Dive into the research topics of 'A multi-dimensional indexing approach for timestamped event sequence matching'. Together they form a unique fingerprint.

Cite this