-
Notifications
You must be signed in to change notification settings - Fork 20
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
suggest your papers here! #1
Comments
it will also come complete with conditioning, so it can be used straightforwardly in DDPMs |
"iUNets - Fully invertible U-Nets with Learnable Up- and Downsampling", https://arxiv.org/abs/2005.05220, https://github.com/cetmann/iunets “Mask2Former - Masked-attention Mask Transformer for Universal Image Segmentation", https://bowenc0221.github.io/mask2former/ Looking forward to the code! :-) |
Hi Micpie! 👋 Is Maskformer considered a unet?? I guess it does have downsamples and then upsample layers |
Hi Phil, yeah, Mask2Former is a little bit different from the standard UNet setup, which very likely makes it tricky to implement it in one abstraction (but I wanted to bring it up, because of the good results they achieved). This is another interesting approach: I'm pretty excited of mixing convs + attn for UNets due to https://github.com/xxxnell/how-do-vits-work I also recently rewrote your UNet code from https://github.com/lucidrains/denoising-diffusion-pytorch/blob/master/denoising_diffusion_pytorch/denoising_diffusion_pytorch.py#L180 to use it as "normal" UNet, but then had to postpone testing it due to other work coming up, but this is why I'm looking forward to your x-unet even more. :-) Cheers, |
@MicPie ok, let me figure out the main inductive bias behind maskformer and get back to you :) |
@MicPie are you using any u-nets for your AI medical imaging work? |
Yeah, I would say UNets are still the standard there. |
@MicPie yea, even the transformers side seems to converge on the same structure (https://arxiv.org/abs/2110.13711 and https://arxiv.org/abs/2202.10890) i'll see what i can do to push things forward :) edit: you are so well read, as usual! |
Thank you for sharing, I didn't know the "Hierarchical Perceiver" paper and the one about UNet variants. 🙏 UNet++ looks really fancy, the corresponding paper has also some nice figures with the setup: https://arxiv.org/abs/1807.10165 |
Hi @lucidrains, please check out our [CVPR2022-Oral] "MAXIM: Multi-Axis MLP for Image Processing", a multi-stage UNet with (standalone) multi-axis MLP layers. It is concurrent work with Restormer with similar performances on multiple low-level tasks. We provide Jax code here: https://github.com/google-research/maxim |
Imagen (needs no introduction) proposes some interesting improvements as a so-called "Efficient U-Net". Might be worth checking out Appendix B.1 for a summary. |
UNet++: A Nested U-Net Architecture for Medical Image Segmentation |
yup, i have it built over at https://github.com/lucidrains/imagen-pytorch and @nousr has successfully used it for his medical segmentation school work |
@vvvm23 i'll see if i can port over the logic in a clean manner |
great list! 🙏 |
I think this paper is worth looking at
|
https://github.com/3dim-paper/website |
Trans-Unet https://arxiv.org/abs/2102.04306
U^2-Net https://arxiv.org/abs/2005.09007
Restormer https://arxiv.org/abs/2111.09881
The text was updated successfully, but these errors were encountered: