Extracting Key-Substring-Group Features for Text Classification (Supplement)
Code
The C program for Key-Substring-Group feature extraction:
[.tar.gz]
[.zip]
It is based on
Prof. Dan Gusfield
's suffix tree code
strmat
.
Another (probably better) implementation can be found in
Jun Li
's
TextMatrix
open-source project.