Lecturer: |
Prof. Dr. Benno Stein |
Advisor: |
Michael Völske |
Workload: |
3 ECTS |
Kick-off meeting: |
April 6th, 17:00, room SR015, B11 |
Regular sessions: |
Thursdays, 17:00, room SR014, B11 |
The ever-increasing flood of digital information poses new challenges to data mining and machine learning practitioners. Data sets of interest routinely reach scales that call for distributed processing architectures. In this seminar, participants will acquaint themselves with a selection of data processing tools based on the Apache Hadoop platform. In a practical part, seminar participants will work on relevant data mining problems. The Webis research group operates a large, modern high-performance compute cluster (about 1600 CPU cores, 2.5 Petabytes of disk space), which will be put to use in the course of this seminar. Students will receive training in the fundamentals of hardware and software architectures of big data cluster technologies, and learn the skills necessary to apply them. Thanks to the size of the cluster and the Webis group's expertise with big data technologies, this seminar shall provide a level of training that is currently exceptional in an academic context.
The deadline for seminar paper submissions is August 18th, 2016, 12:00pm CEST. Submissions should consist of a single ZIP file with the following contents:
All submissions must be handed in via email to michael.voelske[at]uni-weimar.de. The file name of the attached zip file should include the names and matriculation numbers of all group members.
Leskovec, Rajaraman, Ullman. Mining of Massive Datasets. Cambridge University Press, 2014. http://infolab.stanford.edu/~ullman/mmds/book.pdf
Manning, Raghavan, Schütze. Introduction to Information Retrieval. Cambridge University Press, 2008. http://nlp.stanford.edu/IR-book/
Wechsel zwischen Farb- und Schwarz-Weiß-Ansicht
Kontrastansicht aktiv
Kontrastansicht nicht aktiv
Wechsel der Hintergrundfarbe von Weiß zu Schwarz
Darkmode aktiv
Darkmode nicht aktiv
Fokussierte Elemente werden schwarz hinterlegt und so visuell hervorgehoben.
Feedback aktiv
Feedback nicht aktiv
Beendet Animationen auf der Website
Animationen aktiv
Animationen nicht aktiv