Page heading
Languages and Services
  
    You are here menu
    Subpage heading
    Web Technology & Information Systems / Prof. Dr. Benno Stein
    Navigation
    Additional Content
    Main Content

    Corpora

    Webis-CLS-10
    Cross-lingual sentiment dataset.
    [2010 - today]

    PAN-PC-09 [upcoming version PAN-PC-10]
    Plagiarism corpus for the evaluation of plagiarism detection algorithms.
    [2006 - today]

    Webis-WVC-07 [upcoming version PAN-WVC-10]
    Wikipedia vandalism corpus for the evaluation of vandalism detection algorithms.
    [2007 - today]

    Wikipedia Revision Dump
    Wikipedia revison dump corpus for the study of collaborative writing on Wikipedia.
    [2006, 2008]

     

    Content signature

    © Fakultät Medien 11.05.2010 / Kontakt / Impressum / Bemerkung zu dieser Seite