wk-libs: Add Dataset.add_layer_from_images() using pims #741

jstriebel · 2022-06-01T12:16:30Z

Description:

Adds Dataset.add_layer_from_images(), which converts plenty of image (stack) formats to a wk-compatible layer.

For the usage, please check the added test. This can also be executed standalone, which will upload the resulting datasets to the webknossos instance configured in your .env, I'd recommend to use a local wk instance. This helps to inspect the outputs manually.

I moved the respective testfiles to the webknossos package and linked them from wkcuber, since changes in webknossos also trigger CI runs for wkcuber, but not the other way around.

I also adapted the warning behaviour for multiprocessing the in the cluster-tools. Should we also adapt this for the other executors? Might be out-of-scope for this PR though.

Issues:

fixes Add create_from_image_sequence #632

Todos:

Updated Changelog
Added / Updated Tests
Updated Documentation
Considered adding this to the Examples
Add issue with detailed comparison to wkcuber conversion features (e.g. offset is currently missing, maybe some specialized image formats): Expand add_layer_from_images to cubing script #748

jstriebel · 2022-06-01T12:38:20Z

@philippotto I guess this is ready for review now. Some minor parts are still missing, see TODO, but test + code is ready. Let me know if we should do a walkthrough together. I'd also be very happy if you have a good idea how to simplify the logic in pims_images.py.

jstriebel · 2022-06-01T13:01:33Z

PS: The new tests add ~5 minutes. Probably we should look into parallelizing tests or reducing the time somehow. The downloaded datasets might also be cached in some folder (locally gitignored in the repo and via extra caching steps in the CI).

philippotto

Great stuff! I already reviewed most of the PR and left some feedback, but maybe we can have a call about the pims_images.py module, as I think I could use some kind of introduction before reviewing it :)

webknossos/webknossos/dataset/view.py

webknossos/webknossos/dataset/dataset.py

philippotto · 2022-06-02T09:49:54Z

webknossos/webknossos/dataset/dataset.py

+            # if a slice is larger than the others it might happen that
+            # a new chunk is partially written, leading to a warning,
+            # which is ignored in this context


wouldn't this even happen if all slices have the same size but that size is not shard-aligned? also, why is the warning hidden? the user could set compress=False to avoid the performance penalty, no?

I think in the first case this should work, since the first write will set the new bounding-box, which then fits all subsequent writes. Writes must be either shard or bounding-box aligned (per bbox-border). I think this warning is not useful at all in this case, since it only happens at the dataset-borders, where the user can not change much. The only important warning is about the different sizes, which is handled elsewhere. The performance penalty should be negligible, since it's only about the borders of the dataset. It's not that the general chunk-size doesn't fit the blocks during iteration.

ok, please explain this in the code comment too then :)

I expanded the comment, I hope it makes more sense now 👍

webknossos/webknossos/dataset/dataset.py

webknossos/testdata/tiff_with_different_dimensions/3.tif

webknossos/webknossos/dataset/_utils/pims_images.py

philippotto

awesome stuff! didn't look at the tests, yet, but I already have some feedback :)

webknossos/webknossos/dataset/_utils/pims_images.py

jstriebel · 2022-06-08T11:07:04Z

@philippotto Thanks a lot for the review! I think I adressed all points, either in the newest commits or commenting there directly. Please check my newest changes again, thanks 🙏

webknossos/webknossos/dataset/dataset.py

jstriebel · 2022-06-10T10:07:29Z

ping @philippotto

philippotto

Sorry for the late review! I only left some smaller comments (mostly about code comments) :)

webknossos/webknossos/dataset/_utils/pims_images.py

webknossos/webknossos/dataset/dataset.py

philippotto · 2022-06-13T13:54:18Z

webknossos/webknossos/dataset/dataset.py

+            # if a slice is larger than the others it might happen that
+            # a new chunk is partially written, leading to a warning,
+            # which is ignored in this context


ok, please explain this in the code comment too then :)

webknossos/tests/test_from_images.py

Co-authored-by: Philipp Otto <[email protected]>

jstriebel added 2 commits June 1, 2022 14:12

add Dataset.add_layer_from_images using pims

d135743

Merge remote-tracking branch 'origin/master' into add-layer-from-images

abfc508

jstriebel self-assigned this Jun 1, 2022

jstriebel added 2 commits June 1, 2022 14:20

add Changelog entries

2281b9f

formatting & types

19ce244

jstriebel requested a review from philippotto June 1, 2022 12:38

CI: install extras

cadd048

wkcuber-docker: add webknossos testdata mount

6b826f2

philippotto requested changes Jun 2, 2022

View reviewed changes

philippotto requested changes Jun 7, 2022

View reviewed changes

jstriebel added 3 commits June 7, 2022 17:44

apply PR feedback part 1

8aff5e7

apply PR feedback

ffeb61c

Merge remote-tracking branch 'origin/master' into add-layer-from-images

f8c66c4

jstriebel commented Jun 8, 2022

View reviewed changes

webknossos/webknossos/dataset/dataset.py Outdated Show resolved Hide resolved

calc largest_segment_id

d716019

This was referenced Jun 9, 2022

auto-detect conversion should allow --pad #729

Closed

Allow live compression for tiled cubing #44

Open

Merge branch 'master' into add-layer-from-images

34eeb0d

jstriebel requested a review from philippotto June 10, 2022 10:08

jstriebel mentioned this pull request Jun 13, 2022

Expand add_layer_from_images to cubing script #748

Closed

philippotto approved these changes Jun 13, 2022

View reviewed changes

jstriebel and others added 5 commits June 13, 2022 18:06

Update webknossos/webknossos/dataset/_utils/pims_images.py

5c0307e

Co-authored-by: Philipp Otto <[email protected]>

Update webknossos/webknossos/dataset/_utils/pims_images.py

3f5ccd0

Co-authored-by: Philipp Otto <[email protected]>

Merge branch 'master' into add-layer-from-images

0463264

Merge branch 'master' into add-layer-from-images

420d7d2

apply PR feedback

652a4b0

jstriebel enabled auto-merge (squash) June 22, 2022 16:08

jstriebel merged commit 9b2955a into master Jun 22, 2022

jstriebel deleted the add-layer-from-images branch June 22, 2022 16:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

wk-libs: Add Dataset.add_layer_from_images() using pims #741

wk-libs: Add Dataset.add_layer_from_images() using pims #741

jstriebel commented Jun 1, 2022 •

edited

Loading

jstriebel commented Jun 1, 2022

jstriebel commented Jun 1, 2022

philippotto left a comment

philippotto Jun 2, 2022

jstriebel Jun 7, 2022

philippotto Jun 13, 2022

jstriebel Jun 22, 2022

philippotto left a comment

jstriebel commented Jun 8, 2022

jstriebel commented Jun 10, 2022

philippotto left a comment

philippotto Jun 13, 2022

wk-libs: Add Dataset.add_layer_from_images() using pims #741

wk-libs: Add Dataset.add_layer_from_images() using pims #741

Conversation

jstriebel commented Jun 1, 2022 • edited Loading

Description:

Issues:

Todos:

jstriebel commented Jun 1, 2022

jstriebel commented Jun 1, 2022

philippotto left a comment

Choose a reason for hiding this comment

philippotto Jun 2, 2022

Choose a reason for hiding this comment

jstriebel Jun 7, 2022

Choose a reason for hiding this comment

philippotto Jun 13, 2022

Choose a reason for hiding this comment

jstriebel Jun 22, 2022

Choose a reason for hiding this comment

philippotto left a comment

Choose a reason for hiding this comment

jstriebel commented Jun 8, 2022

jstriebel commented Jun 10, 2022

philippotto left a comment

Choose a reason for hiding this comment

philippotto Jun 13, 2022

Choose a reason for hiding this comment

jstriebel commented Jun 1, 2022 •

edited

Loading