Additional Content
Main Content
Corpora
Webis-CLS-10
Cross-lingual sentiment dataset.
[2010 - today]
PAN-PC-09 [upcoming version PAN-PC-10]
Plagiarism corpus for the evaluation of plagiarism detection algorithms.
[2006 - today]
Webis-WVC-07 [upcoming version PAN-WVC-10]
Wikipedia vandalism corpus for the evaluation of vandalism detection algorithms.
[2007 - today]
Wikipedia Revision Dump
Wikipedia revison dump corpus for the study of collaborative writing on Wikipedia.
[2006, 2008]
Content signature
© Fakultät Medien 11.05.2010 / Kontakt / Impressum / Bemerkung zu dieser Seite



