Index of /~gvcormac/corpus

[ICO]NameLast modifiedSizeDescription

[PARENTDIR]Parent Directory  -  
[   ]corpus.tgz2006-06-08 08:20 11K 
[DIR]ham/2006-06-08 09:26 -  
[DIR]spam/2006-06-08 09:26 -  
[TXT]README.html2006-06-08 09:41 368  

TREC 2006 Sample Chinese Corpus

You can browse this corpus of 18 sample messages, or download the .tgz file. It is in TREC toolkit format.

Many thanks to

Dr. Quang-Anh Tran
CERNET Computer Emergency Response Team (CCERT)
FIT 4-204, Tsinghua University, Beijing , China, 100084 

for supplying the data and for help with corpus creation.