Skip to content
This repository has been archived by the owner on Sep 18, 2024. It is now read-only.
/ lab-kappa Public archive

Hands-on lab about the Kappa architecture and Spark streaming - assignment

License

Notifications You must be signed in to change notification settings

IBCNServices/lab-kappa

Repository files navigation

Big Data hands-on - Kappa Architecture

Getting started

Note: Please follow the installation docs for the Kappa course first.

  1. Clone this repository to your computer using git.

  2. Open lab-kappa.code-workspace using Visual Studio Code.

  3. Choose "Yes, I trust the authors", so VSCode enables regular mode.

    workspace-trust-dialog

  4. Click on the "Remote" icon on the bottom left of VSCode.

    remote-button

    1. Choose the option "Reopen in container" from the command palette.

      reopen-in-container

    2. The container will be automatically created based on the configuration included in the lab files.

  5. Wait until the container is set up. This can take a few minutes because the container needs to be pulled and built. You can check the progress by clicking "Starting Dev Container (show log)" in the notification on the bottom right of VSCode.

  6. When the containers are setup, open introduction.ipynb and follow the instructions there. When asked about which kernel to use, always choose the Python kernel base (Python 3.9.7) /opt/conda/bin/python.

Appendix

Reset environment and remove databases

If you encounter uncommon issues with Spark or connectivity issues to Kafka, it might help to remove all containers from your system, restart vscode and choose "open folder in container" again.

For users on Linux and macOS, run the following command in order to remove ALL containers from your system.

docker stop $(docker ps -aq) && docker rm $(docker ps -aq)

Windows users can do this from the Docker GUI or run the following commands in PowerShell:

docker stop $(docker ps -aq)
docker rm $(docker ps -aq)

Copyright

Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 International License.

Copyright © teaching staff of the Big Data hands-on course at UGent Academie voor Ingenieurs (UGain) at the Faculty of Engineering and Architecture - Ghent University.

About

Hands-on lab about the Kappa architecture and Spark streaming - assignment

Resources

License

Stars

Watchers

Forks

Packages

No packages published