Text datasets in matlab format


(Matlab 7 or higher version is required to open these files)
We provide here the tf vector of each document. One can use the following codes to generate the tf-idf vector. ([1+log(tf)]*log[N/df])
fea = tfidf(fea);
%Now each row of fea is the normalized tf-idf vector (the length of each vector is 1).

If you used the processed data sets on this page, we appreciate it very much if you can cite our following works:

Return to Codes and Data