Mondays at 11.00, Seminar room Bauhausstraße 11/013
The ever-increasing flood of digital information poses new challenges to data mining and machine learning practitioners. Data sets of interest routinely reach scales that call for distributed processing architectures. There is a great variety of problems to be solved in the areas of data mining, processing, and storage, and a vast landscape of software projects has arisen to address these problems. In this seminar, participants will get to know a selection of big data tools, and will gain hands-on experience in deploying, administering and using distributed systems.
In order to receive a grade, seminar participants should:
Give a seminar presentation (up to ~30min)
Provide a demo implementation and examples
Actively participate in discussions on other participants' talks
Talk topics and task details will be provided in one of the early seminar sessions.
[2019-04-15] Kick-off meeting / crash course [slides]. Before the first meeting, please install everything mentioned under `Software` below on your laptop. You should bring your laptop to the seminar in order to participate in the tutorial sessions.
In order to participate in the seminar, please install the following software:
Oracle VirtualBox [download] Download and install the VirtualBox platform package for your operating system (Windows/Mac/Linux). [Note to Windows users (Win 8 and later): you need to disable the Hyper-V feature, otherwise VirtualBox will not work.]
Vagrant by HashiCorp [download] Download and install the Vagrant VM management software for your operating system (Windows/Mac/Linux).
Git [download] Download and install the Git source control software for your operating system (Windows/Mac/Linux).
FoxyProxy [firefox] [chrome] Install the FoxyProxy Browser extension in your preferred web browser.
In preparation of the first seminar session, install all of the above, then open a terminal (Windows users: run the app called "Git Bash"), and type or paste the following: vagrant box add --provider virtualbox bento/ubuntu-18.04 Afterward, press Enter.
In addition to the software mentioned above, you should have a decent source code editor installed that you know how to use. If you don't have a preference of your own, we recommend Visual Studio Code (available for all platforms).