Skip to content
This repository has been archived by the owner on Aug 1, 2024. It is now read-only.

Did you ever train with the ViTC? #63

Open
cvg25 opened this issue May 30, 2024 · 0 comments
Open

Did you ever train with the ViTC? #63

cvg25 opened this issue May 30, 2024 · 0 comments

Comments

@cvg25
Copy link

cvg25 commented May 30, 2024

Hey,
Thanks for sharing such a great work. Digging into the code, I've noticed that there is a ConvEmbed layer from this paper implemented in the vision transformer. I was wondering whether you had a chance to train with this layer, taking into account that masking is not as straightforward as with the common PatchEmbed.
Thanks,
Carlos

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant