I modified a part of the code to enable parallel inference with multiple num_batch #1113

zhugelaozei · 2024-12-26T14:27:37Z

Hi！Since I found that SAHI cannot perform parallel inference when using YOLOv11 for slicing inference, I modified part of the code and adapted it to work with the relevant parts of Ultralytics' code.Unfortunately, I have only adapted the Ultralytics part of the code for now.I hope this is helpful to you.

fcakyon · 2025-01-04T15:12:26Z

Great @zhugelaozei ! Can you please fix the formatting by:

Install development dependencies:
pip install -e ."[dev]"
Run code formatting:
python -m scripts.run_code_style format

tonyreina · 2025-01-06T21:38:12Z

Does the num_batch here refer to running bounding box detections on multiple slices of the same image in a batch? Or is it running multiple images at one time in a batch? Could you provide an example of how to use it?

eVen-gits · 2025-02-04T08:46:04Z

Hello,
I believe this is a very important feature. What is the current status on this?

eVen-gits · 2025-02-07T08:57:25Z

Does the num_batch here refer to running bounding box detections on multiple slices of the same image in a batch? Or is it running multiple images at one time in a batch? Could you provide an example of how to use it?

@tonyreina hey. I believe it refers to one image. This is also the usual way to use sahi - large image, sliced into smaller ones.

On the topic. I tried the modifications today and it works. Here are my observations:

It appears that the program crashes, if perform_standard_pred is not set to False
Even though sliced prediction seems to work, the performance gains seem to be lower than expected. My predictions for a 12768x9564 (122.1MP) image, using 640x640 slices goes from ~13s to ~8s (I don't have an accurate metric).

With regards to the latter, I suspect it's similar to what I observed running multiple instances of inference on separate processes.

One reason is likely the data loading limitation, but more than that, I would suspect it has to do with some low level locking of the GPU operations? I really am not an expert in terms of hardware utilization, but perhaps someone with more experience could shed some light on this topic.

Either way, it's a welcome addition and I hope this change is seriously considered.

dceluis · 2025-02-13T09:02:06Z

Hello. I did quite some work on this a while ago. While I did not submit a PR for this I thought I would submit the link here for reference in the hopes that some of the implementation would help getting this PR merged.

https://github.com/dceluis/sahi_batched

zhugelaozei and others added 4 commits December 26, 2024 22:21

predict.py

1976635

prediction.py

f5fec83

ultralytics.py

42fd93b

Merge branch 'main' into main

0f08446

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

I modified a part of the code to enable parallel inference with multiple num_batch #1113

I modified a part of the code to enable parallel inference with multiple num_batch #1113

zhugelaozei commented Dec 26, 2024

fcakyon commented Jan 4, 2025

tonyreina commented Jan 6, 2025

eVen-gits commented Feb 4, 2025

eVen-gits commented Feb 7, 2025

dceluis commented Feb 13, 2025

I modified a part of the code to enable parallel inference with multiple num_batch #1113

Are you sure you want to change the base?

I modified a part of the code to enable parallel inference with multiple num_batch #1113

Conversation

zhugelaozei commented Dec 26, 2024

fcakyon commented Jan 4, 2025

tonyreina commented Jan 6, 2025

eVen-gits commented Feb 4, 2025

eVen-gits commented Feb 7, 2025

dceluis commented Feb 13, 2025