how do vision transformers deal with input images with different sizes?

330 views Asked by At

I want to train a vision transformer with progressive learning which is used in EffientNetV2. Is there any way to do this in a transformer model?

0

There are 0 answers