feat: countgd sam2 video support #318

hrnn · 2024-12-05T13:41:28Z

Added countgd_sam2_video_tracking tool, to use countgd for object detection and pass the results to sam2 to track the objects in the entire video.

Link to Colab

Note to the reviewer:

I had to refactor a bit my initial solution due to mypy complains when using too generic parameters.

tests/integ/test_tools.py

dillonalaird

looks good! just some minor comments

dillonalaird · 2024-12-07T01:19:50Z

vision_agent/tools/tools.py

+    OWLV2 = "owlv2"
+
+
+def od_sam2_video_tracking(


vision_agent/tools/tools.py

dillonalaird · 2024-12-07T01:43:13Z

vision_agent/tools/tools.py

+    """'countgd_sam2_video_tracking' is a tool that can segment multiple objects given a text
+    prompt such as category names or referring expressions. The categories in the text
+    prompt are separated by commas. It returns a list of bounding boxes, label names,
+    mask file names and associated probability scores of 1.0.


minro comment, only florence2 returns probability scores of 1.0, countgd and owlv2 will can return regular probability scores. So you can just say "and associated probability scores."

dillonalaird · 2024-12-07T01:43:23Z

vision_agent/tools/tools.py

+    """'owlv2_sam2_video_tracking' is a tool that can segment multiple objects given a text
+    prompt such as category names or referring expressions. The categories in the text
+    prompt are separated by commas. It returns a list of bounding boxes, label names,
+    mask file names and associated probability scores of 1.0.


see comment above on prob scores

dillonalaird · 2024-12-07T01:44:56Z

vision_agent/tools/tools.py

+        List[Dict[str, Any]]: A list of dictionaries containing the score, label,
+            bounding box, and mask of the detected objects with normalized coordinates
+            (xmin, ymin, xmax, ymax). xmin and ymin are the coordinates of the top-left
+            and xmax and ymax are the coordinates of the bottom-right of the bounding box.
+            The mask is binary 2D numpy array where 1 indicates the object and 0 indicates
+            the background.
+
+    Example
+    -------
+        >>> countgd_sam2_video_tracking("car, dinosaur", image)
+        [
+            {
+                'score': 1.0,
+                'label': 'dinosaur',
+                'bbox': [0.1, 0.11, 0.35, 0.4],
+                'mask': array([[0, 0, 0, ..., 0, 0, 0],
+                    [0, 0, 0, ..., 0, 0, 0],
+                    ...,
+                    [0, 0, 0, ..., 0, 0, 0],
+                    [0, 0, 0, ..., 0, 0, 0]], dtype=uint8),
+            },
+        ]


Use the return values and examples from florence2_sam2_video_tracking. It's actually a list of list of dictionaries where the inner list is a frame

dillonalaird · 2024-12-07T01:45:09Z

vision_agent/tools/tools.py

+    Returns:
+        List[Dict[str, Any]]: A list of dictionaries containing the score, label,
+            bounding box, and mask of the detected objects with normalized coordinates
+            (xmin, ymin, xmax, ymax). xmin and ymin are the coordinates of the top-left
+            and xmax and ymax are the coordinates of the bottom-right of the bounding box.
+            The mask is binary 2D numpy array where 1 indicates the object and 0 indicates
+            the background.
+
+    Example
+    -------
+        >>> countgd_sam2_video_tracking("car, dinosaur", image)
+        [
+            {
+                'score': 1.0,
+                'label': 'dinosaur',
+                'bbox': [0.1, 0.11, 0.35, 0.4],
+                'mask': array([[0, 0, 0, ..., 0, 0, 0],
+                    [0, 0, 0, ..., 0, 0, 0],
+                    ...,
+                    [0, 0, 0, ..., 0, 0, 0],
+                    [0, 0, 0, ..., 0, 0, 0]], dtype=uint8),
+            },
+        ]


see comment above on return comments

dillonalaird

LGTM

hrnn added 4 commits December 4, 2024 23:29

feat: countgd sam2 video

fdfc38c

Merge branch 'main' into feat/countgd_sam2_video

519e414

added test

6bcd217

added test

50faa2e

hrnn self-assigned this Dec 5, 2024

hrnn added 5 commits December 5, 2024 11:20

handle empty chunk length

db40ef4

fixed mypy issues

e0a7cb2

updated docstring

984eaa3

allow multiple od tools

87a787c

added owlv2

07b4e19

hrnn commented Dec 6, 2024

View reviewed changes

tests/integ/test_tools.py Outdated Show resolved Hide resolved

fixed import

f193438

hrnn marked this pull request as ready for review December 6, 2024 16:14

hrnn requested review from hugohonda, dillonalaird, CamiloInx and camiloaz December 6, 2024 16:14

dillonalaird requested changes Dec 7, 2024

View reviewed changes

hrnn added 5 commits December 9, 2024 12:18

fixed probability

12062e0

fixed return example

1219fef

added florence2 support

36607d2

fixed mypy

c1320b8

merge from main

935e985

dillonalaird approved these changes Dec 11, 2024

View reviewed changes

hrnn merged commit 421dd1e into landing-ai:main Dec 13, 2024
8 checks passed

hrnn deleted the feat/countgd_sam2_video branch December 13, 2024 14:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: countgd sam2 video support #318

feat: countgd sam2 video support #318

hrnn commented Dec 5, 2024 •

edited

Loading

dillonalaird left a comment

dillonalaird Dec 7, 2024

dillonalaird Dec 7, 2024

hrnn Dec 9, 2024

dillonalaird Dec 7, 2024

hrnn Dec 9, 2024

dillonalaird Dec 7, 2024

hrnn Dec 9, 2024

dillonalaird Dec 7, 2024

hrnn Dec 9, 2024

dillonalaird left a comment

feat: countgd sam2 video support #318

feat: countgd sam2 video support #318

Conversation

hrnn commented Dec 5, 2024 • edited Loading

dillonalaird left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

dillonalaird left a comment

Choose a reason for hiding this comment

hrnn commented Dec 5, 2024 •

edited

Loading