Note: Please follow the installation docs for the Kappa course first.
-
Clone this repository to your computer using git.
-
Open
lab-kappa.code-workspace
using Visual Studio Code. -
Choose "Yes, I trust the authors", so VSCode enables regular mode.
-
Click on the "Remote" icon on the bottom left of VSCode.
-
Wait until the container is set up. This can take a few minutes because the container needs to be pulled and built. You can check the progress by clicking "Starting Dev Container (show log)" in the notification on the bottom right of VSCode.
-
When the containers are setup, open
introduction.ipynb
and follow the instructions there. When asked about which kernel to use, always choose the Python kernelbase (Python 3.9.7) /opt/conda/bin/python
.
If you encounter uncommon issues with Spark or connectivity issues to Kafka, it might help to remove all containers from your system, restart vscode and choose "open folder in container" again.
For users on Linux and macOS, run the following command in order to remove ALL containers from your system.
docker stop $(docker ps -aq) && docker rm $(docker ps -aq)
Windows users can do this from the Docker GUI or run the following commands in PowerShell:
docker stop $(docker ps -aq)
docker rm $(docker ps -aq)
This work is licensed under a Creative Commons Attribution 4.0 International License.
Copyright © teaching staff of the Big Data hands-on course at UGent Academie voor Ingenieurs (UGain) at the Faculty of Engineering and Architecture - Ghent University.