Dictionary based compression type classification using a CNN architecture

Hyewon Song, Beom Kwon, Seongmin Lee, Sanghoon Lee

Research output: Chapter in Book/Report/Conference proceedingConference contribution

6 Citations (Scopus)

Abstract

As digital devices which are capable of viewing contents easily such as mobile phones and tablet PCs have become widespread, the number of digital crimes using these digital contents also increases. Usually, the data which can be the evidence of crimes is compressed and the header of data is damaged to conceal the contents. Therefore, it is necessary to identify the characteristics of the entire bits of the compressed data to discriminate the compression type not using the header of the data. In this paper, we propose a method for distinguishing 16 dictionary-based compression types. We utilize 5-layered Convolutional Neural Network (CNN) for classification of compression type using Spatial Pyramid Pooling (SPP) layer. We evaluate our proposed method on the Wikileaks Dataset, which is a text file database. The average accuracy of 16 dictionary-based compression algorithms is 99%. We expect that our proposed method will be useful for providing evidence for Digital Forensics.

Original languageEnglish
Title of host publication2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2019
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages245-248
Number of pages4
ISBN (Electronic)9781728132488
DOIs
Publication statusPublished - 2019 Nov
Event2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2019 - Lanzhou, China
Duration: 2019 Nov 182019 Nov 21

Publication series

Name2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2019

Conference

Conference2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2019
Country/TerritoryChina
CityLanzhou
Period19/11/1819/11/21

Bibliographical note

Funding Information:
ACKNOWLEDGMENT This work was supported by the research fund of Signal Intelligence Research Center supervised by Defense Acquisition Program Administration and Agency for Defense Development of Korea.

Publisher Copyright:
© 2019 IEEE.

All Science Journal Classification (ASJC) codes

  • Information Systems

Fingerprint

Dive into the research topics of 'Dictionary based compression type classification using a CNN architecture'. Together they form a unique fingerprint.

Cite this