Batch cost analysis #959

avrohomgottlieb · 2024-11-15T20:04:37Z

Context

In issue #944, we succeed in generating computed files on Batch and successfully ran jobs for each download_config.

Issues #956 and #957 will address certain discrepancies that arose in the output files and during job runs themselves (platform related).

In this issue we should look into different resource allocation strategies for determining optimal cost. As a result of issues 956 and 957, the bottlenecks should become clearer and we'll be able to ascertain the best possible cost strategies.

We should attempt to collect and produce the results of these different strategies, as they pertain to items like job duration / file size / memory size / networking data, etc.

avrohomgottlieb · 2024-12-03T16:41:28Z

Now that we've spent the last few weeks optimizing the Batch implementation and working out the kinks, this issue is going to be repurposed for the goal of just measuring the baseline time and cost of reloading all projects on staging.

We're going to keep the current queue and compute_environment configurations, with 16 vCPUs and 200 GB of Ephemeral storage.

avrohomgottlieb · 2024-12-09T16:38:06Z

Dev Stack Results

Below were the results from running everything on my dev stack.

The dev stack utilized the following resources:

compute environment: 16 vCPU
job definition: 1 vCPU, 4 GB Memory, 200 GB of ephemeral storage per job
job queue: 1 queue

Location	Process	Total Duration	First Job Received	Last Job Completed
API	Metadata Loading	00:27:00	Sunday 14:37	Sunday 15:04
Batch	Computed File Generation	02:58:56	Sunday 15:04:34	Sunday 18:03:30

avrohomgottlieb · 2024-12-09T17:23:50Z

Staging Results

The following are the results of running the entire portal on Batch with different resource allocations:

Compute Environment	Job Definition	Total Duration	First Job Received	Last Job Completed
16 vCPU	1.0 vCPU	-	-	-
16 vCPU	0.5 vCPU	-	-	-
32 vCPU	1.0 vCPU	-	-	-
32 vCPU	0.5 vCPU	-	-	-

avrohomgottlieb self-assigned this Nov 19, 2024

davidsmejia mentioned this issue Jan 23, 2025

propagate tags from aws_batch_job_definition #1057

Merged

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Batch cost analysis #959

Batch cost analysis #959

avrohomgottlieb commented Nov 15, 2024 •

edited

Loading

avrohomgottlieb commented Dec 3, 2024

avrohomgottlieb commented Dec 9, 2024 •

edited

Loading

avrohomgottlieb commented Dec 9, 2024 •

edited

Loading

Batch cost analysis #959

Batch cost analysis #959

Comments

avrohomgottlieb commented Nov 15, 2024 • edited Loading

Context

avrohomgottlieb commented Dec 3, 2024

avrohomgottlieb commented Dec 9, 2024 • edited Loading

Dev Stack Results

avrohomgottlieb commented Dec 9, 2024 • edited Loading

Staging Results

avrohomgottlieb commented Nov 15, 2024 •

edited

Loading

avrohomgottlieb commented Dec 9, 2024 •

edited

Loading

avrohomgottlieb commented Dec 9, 2024 •

edited

Loading