talk_id | talk_slug | talk_type | talk_tags | session_slug | talk_title | talk_title_short | talk_materials_url | speakers | |||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
22199 |
scaling-automating-r-workflows-kubernetes |
regular |
|
rapid-response |
Scaling and automating R workflows with Kubernetes and Airflow |
Scaling and automating R workflows with Kubernetes and Airflow |
|
During the pandemic, epidemiologists have been forced to adapt to the unprecedented scale of the data and high cadence of reporting.
At the UK Health Security Agency, we have created a platform for teams to easily deploy R and/or Python tasks onto our High-Performance Computing resources, scheduling their execution, and allowing previously unthinkable workloads to be executed with ease. Thanks to Kubernetes, git, Docker, and Airflow, our epidemiologists can stop worrying about their laptop's memory and bandwidth, and focus on answering the crucial questions of the pandemic. We'd like to tell you how we did it.