Multi Input Support #36

CaptainTrojan · 2024-06-01T23:04:57Z

Motivation

The goal is to support multi-input models and loaders in luxonis-train.

Examples

Multi-input loader

The loader simulates an input consisting of stereo images (left and right), disparity maps, and point clouds. It also identifies "left" as the primary image type, which can be used in visualizers.

class CustomMultiInputLoader(BaseLoaderTorch):
    def __init__(self, *args, **kwargs):
        super().__init__(*args, **kwargs)

    @property
    def input_shape(self) -> dict[str, Size]:
        return {
            "left": torch.Size([3, 224, 224]),
            "right": torch.Size([3, 224, 224]),
            "disparity": torch.Size([1, 224, 224]),
            "pointcloud": torch.Size([1000, 3]),
        }

    @property
    def image_source(self) -> str:
        return "left"

    def __getitem__(self, idx):
        # Fake data
        left = torch.rand(3, 224, 224, dtype=torch.float32)
        right = torch.rand(3, 224, 224, dtype=torch.float32)
        disparity = torch.rand(1, 224, 224, dtype=torch.float32)
        pointcloud = torch.rand(1000, 3, dtype=torch.float32)
        inputs = {
            "left": left,
            "right": right,
            "disparity": disparity,
            "pointcloud": pointcloud,
        }

        # Fake labels
        segmap = torch.zeros(1, 224, 224, dtype=torch.float32)
        labels = {
            "segmentation": (segmap, LabelType.SEGMENTATION)
        }

        return inputs, labels

    def __len__(self):
        return 10

    def get_classes(self) -> dict[LabelType, list[str]]:
        return {LabelType.SEGMENTATION: ["square"]}

Multi-input model

This model includes nodes such as FullBackbone, RGBDBackbone (processing left, right, and disparity inputs), and PointcloudBackbone (processing pointcloud inputs). It also features FusionNeck and FusionNeck2 for combining outputs from these backbones, and CustomSegHead1 and CustomSegHead2 which also make use of the different fusion necks and inputs. The model showcases our capability to handle intricate data fusion, multi-modal processing, and serving loader data to any node in the graph.

model:
  name: example_multi_input
  nodes:
    - name: FullBackbone
      alias: full_backbone  # By default loads all loader sub-elements since no inputs nor input_sources are provided

    - name: RGBDBackbone
      alias: rgbd_backbone
      input_sources:
        - left
        - right
        - disparity

    - name: PointcloudBackbone
      alias: pointcloud_backbone
      input_sources:
        - pointcloud

    - name: FusionNeck
      alias: fusion_neck
      inputs:
        - rgbd_backbone
        - pointcloud_backbone
      input_sources:
        - disparity

    - name: FusionNeck2
      alias: fusion_neck_2
      inputs:
        - rgbd_backbone
        - pointcloud_backbone
        - full_backbone
    
    - name: CustomSegHead1
      alias: head_1
      inputs:
        - fusion_neck

    - name: CustomSegHead2
      alias: head_2
      inputs:
        - fusion_neck
        - fusion_neck_2
      input_sources:
        - disparity

Test changes

Updated collate_fn test due to how inputs were changed from Tensors to dict[str, Tensor].

Breaking changes

Changed the type of original_in_shape of LuxonisModel from Size to dict[str, Size] (multiple inputs), so any custom models making use of this will stop working. To resolve this, users can make use of the new structure like this.

…th export

…ead of bare tensors

…put_sources, which tells the LuxonisModel which loader sub-elements it wants to load, and LoaderConfig has an images_name parameter which identifies the image-like input among the sub-elements for compatibility with visualizers etc.

…aderTorchOutput.

…_dim from shapes

…ng use of images_name.

github-actions · 2024-06-01T23:17:28Z

☂️ Python Coverage

current status: ✅

Overall Coverage

Lines	Covered	Coverage	Threshold	Status
4885	3758	77%	0%	🟢

New Files

No new covered files...

Modified Files

File	Coverage	Status
luxonis_train/attached_modules/base_attached_module.py	84%	🟢
luxonis_train/attached_modules/losses/adaptive_detection_loss.py	98%	🟢
luxonis_train/attached_modules/metrics/mean_average_precision.py	100%	🟢
luxonis_train/attached_modules/metrics/mean_average_precision_keypoints.py	94%	🟢
luxonis_train/attached_modules/metrics/object_keypoint_similarity.py	95%	🟢
luxonis_train/attached_modules/visualizers/utils.py	80%	🟢
luxonis_train/core/core.py	80%	🟢
luxonis_train/models/luxonis_model.py	90%	🟢
luxonis_train/models/luxonis_output.py	72%	🟢
luxonis_train/nodes/base_node.py	76%	🟢
luxonis_train/nodes/bisenet_head.py	50%	🟢
luxonis_train/nodes/classification_head.py	100%	🟢
luxonis_train/nodes/efficient_bbox_head.py	100%	🟢
luxonis_train/nodes/implicit_keypoint_bbox_head.py	92%	🟢
luxonis_train/nodes/segmentation_head.py	100%	🟢
luxonis_train/utils/boxutils.py	83%	🟢
luxonis_train/utils/config.py	95%	🟢
luxonis_train/utils/general.py	79%	🟢
luxonis_train/utils/loaders/base_loader.py	76%	🟢
luxonis_train/utils/loaders/luxonis_loader_torch.py	100%	🟢
TOTAL	87%	🟢

updated for commit: 9987d0d by action🐍

CaptainTrojan · 2024-06-03T08:02:22Z

I see that the MacOS tests fail for all open PRs, so opening this one as well.

tests/integration/test_multi_input.py

kozlov721 · 2024-06-04T19:03:38Z

luxonis_train/utils/loaders/base_loader.py

+        self._images_name = images_name
+
+    @property
+    def images_name(self) -> str:


I would rename this to maybe source_name. It doesn't have to necessarily be images

Also raising an error in case it's None.

I would rename this to maybe source_name. It doesn't have to necessarily be images

Or do we always expect this to be an image?

The property images_name I created due to compatibility with image loggers/visualizers, which expect some of the loader inputs to contain image-like objects which it can process in OpenCV, log as images, etc. Right now, the config.yaml owner is responsible for setting this property for their custom multi-input logger so that the rest of the code knows which inputs are images.

Also raising an error in case it's None.

I moved the declaration of images_name to the config, which has 'features' as the value by default, so unless the user implements their custom loader overriding this property as None, it should never be None. Is that how you produced this scenario?

I see, that makes sense. Though I'd maybe rename it to image_source or something like that.

…to code structure requirements. Added missing tests for evaluation, export, and inference.

kozlov721 · 2024-06-10T09:03:57Z

I see that the MacOS tests fail for all open PRs, so opening this one as well.

Yeah, you can ignore the MacOS tests. The MacOS machines are running out of memory, I'm planning to fix it eventually, but it's low priority.

github-actions · 2024-06-12T23:02:44Z

Test Results

4 files 4 suites 1h 0m 47s ⏱️
68 tests 43 ✅ 25 💤 0 ❌
272 runs 172 ✅ 100 💤 0 ❌

Results for commit 9987d0d.

♻️ This comment has been updated with latest results.

Co-authored-by: GitHub Actions <[email protected]> Co-authored-by: Martin Kozlovsky <[email protected]>

CaptainTrojan added 8 commits June 2, 2024 00:27

Multi-input test case - building a complex multi-input POSET model wi…

4041c99

…th export

Added a test for new collate_fn, which collates dicts of tensors inst…

6809f98

…ead of bare tensors

Added a config for a multi-input model

f400246

Updated input_shape type and the return type of loader, the LuxonisLo…

769aaab

…aderTorchOutput.

Added images_name property to BaseNode class, removed redundant batch…

9cf7ebb

…_dim from shapes

Implemented multi-input support in node building and model export

4284fba

Compatibility changes due to the new way shapes work in loaders, maki…

0e1ceff

…ng use of images_name.

[Automated] Updated coverage badge

951b981

CaptainTrojan requested review from kozlov721, klemen1999, tersekmatija and conorsim June 3, 2024 08:02

CaptainTrojan marked this pull request as ready for review June 3, 2024 08:03

kozlov721 reviewed Jun 4, 2024

View reviewed changes

tests/integration/test_multi_input.py Outdated Show resolved Hide resolved

kozlov721 reviewed Jun 4, 2024

View reviewed changes

kozlov721 changed the title ~~Feature/multi input~~ Multi Input Support Jun 6, 2024

kozlov721 added the enhancement New feature or request label Jun 6, 2024

kozlov721 assigned CaptainTrojan Jun 6, 2024

Moved 'images_name' setting from loader implementation to config due …

1cdcb9f

…to code structure requirements. Added missing tests for evaluation, export, and inference.

Merge branch 'dev' into feature/multi-input

6c69875

kozlov721 added 5 commits June 13, 2024 01:20

removed macos tests

6b07cd8

removed images_name from BaseNode

63c6ad0

renamed get_shape_pocket to to_shape_pocket

a4906e7

simplified input source handling

25d1758

renamed images_name to image_source

9987d0d

kozlov721 merged commit 669a023 into dev Jun 13, 2024
8 checks passed

kozlov721 deleted the feature/multi-input branch June 13, 2024 02:55

kozlov721 mentioned this pull request Oct 9, 2024

LuxonisTrain - v0.1.0 #102

Merged

kozlov721 added a commit that referenced this pull request Oct 9, 2024

Multi Input Support (#36)

a4fd904

Co-authored-by: GitHub Actions <[email protected]> Co-authored-by: Martin Kozlovsky <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Multi Input Support #36

Multi Input Support #36

CaptainTrojan commented Jun 1, 2024 •

edited by kozlov721

Loading

github-actions bot commented Jun 1, 2024 •

edited

Loading

CaptainTrojan commented Jun 3, 2024

kozlov721 Jun 4, 2024 •

edited

Loading

kozlov721 Jun 4, 2024

kozlov721 Jun 4, 2024

CaptainTrojan Jun 9, 2024

CaptainTrojan Jun 9, 2024 •

edited

Loading

kozlov721 Jun 10, 2024

kozlov721 commented Jun 10, 2024

github-actions bot commented Jun 12, 2024 •

edited

Loading

Multi Input Support #36

Multi Input Support #36

Conversation

CaptainTrojan commented Jun 1, 2024 • edited by kozlov721 Loading

Motivation

Examples

Multi-input loader

Multi-input model

Test changes

Breaking changes

github-actions bot commented Jun 1, 2024 • edited Loading

☂️ Python Coverage

Overall Coverage

New Files

Modified Files

CaptainTrojan commented Jun 3, 2024

kozlov721 Jun 4, 2024 • edited Loading

Choose a reason for hiding this comment

kozlov721 Jun 4, 2024

Choose a reason for hiding this comment

kozlov721 Jun 4, 2024

Choose a reason for hiding this comment

CaptainTrojan Jun 9, 2024

Choose a reason for hiding this comment

CaptainTrojan Jun 9, 2024 • edited Loading

Choose a reason for hiding this comment

kozlov721 Jun 10, 2024

Choose a reason for hiding this comment

kozlov721 commented Jun 10, 2024

github-actions bot commented Jun 12, 2024 • edited Loading

Test Results

CaptainTrojan commented Jun 1, 2024 •

edited by kozlov721

Loading

github-actions bot commented Jun 1, 2024 •

edited

Loading

kozlov721 Jun 4, 2024 •

edited

Loading

CaptainTrojan Jun 9, 2024 •

edited

Loading

github-actions bot commented Jun 12, 2024 •

edited

Loading