A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2017; you can also visit the original URL.
The file type is application/pdf
.
Sceadan: Using Concatenated N-Gram Vectors for Improved File and Data Type Classification
2013
IEEE Transactions on Information Forensics and Security
Over 20 studies have been published in the past decade involving file and data type classification for digital forensics and information security applications. Methods using n-grams as inputs have proven the most successful across a wide variety of types; however, there are mixed results regarding the utility of unigrams and bigrams as inputs independently. In this study, we use support vector machines (SVMs) consisting of unigrams and bigrams, as well as complexity and other byte
doi:10.1109/tifs.2013.2274728
fatcat:u73jtlq3bzgjvmo6ffbsnuvg54