Random effects logistic regression model for anomaly detection

Min Seok Mok, So Young Sohn, Yong Han Ju

Research output: Contribution to journalArticle

23 Citations (Scopus)

Abstract

As the influence of the internet continues to expand as a medium for communications and commerce, the threat from spammers, system attackers, and criminal enterprises has grown accordingly. This paper proposes a random effects logistic regression model to predict anomaly detection. Unlike the previous studies on anomaly detection, a random effects model was applied, which accommodates not only the risk factors of the exposures but also the uncertainty not explained by such factors. The specific factors of the risk category such as retained 'protocol type' and 'logged in' are included in the proposed model. The research is based on a sample of 49,427 random observations for 42 variables of the KDD-cup 1999 (Data Mining and Knowledge Discovery competition) data set that contains 'normal' and 'anomaly' connections. The proposed model has a classification accuracy of 98.94% for the training data set, while that for the validation data set is 98.68%.

Original languageEnglish
Pages (from-to)7162-7166
Number of pages5
JournalExpert Systems with Applications
Volume37
Issue number10
DOIs
Publication statusPublished - 2010

All Science Journal Classification (ASJC) codes

  • Engineering(all)
  • Computer Science Applications
  • Artificial Intelligence

Fingerprint Dive into the research topics of 'Random effects logistic regression model for anomaly detection'. Together they form a unique fingerprint.

  • Cite this