Index of /~gvcormac/corpus

[ICO]NameLast modifiedSizeDescription

[PARENTDIR]Parent Directory  -  
[TXT]README.html2006-06-08 09:41 368  
[   ]corpus.tgz2006-06-08 08:20 11K 
[DIR]ham/2006-06-08 09:26 -  
[DIR]spam/2006-06-08 09:26 -  

TREC 2006 Sample Chinese Corpus

You can browse this corpus of 18 sample messages, or download the .tgz file. It is in TREC toolkit format.

Many thanks to

Dr. Quang-Anh Tran
CERNET Computer Emergency Response Team (CCERT)
FIT 4-204, Tsinghua University, Beijing , China, 100084 

for supplying the data and for help with corpus creation.