List Question
20 TechQA 2024-03-28T18:58:17.310000Getting a Memory Out Error while Multiplying two 4D tensors with shape (1, 4, 2097152, 32)
31 views
Asked by Oshan Devinda
Custom patch embedding layer for pre-trained Vision transformers
26 views
Asked by paper
Constant Accuracy in Swin Transformer Training: Why is accuracy not improving?
17 views
Asked by Pranav Dubal
This code runs perfectly but I wonder what the parameter 'x' in my_forward function refers to
31 views
Asked by Mohammad Elghandour
visualizing attention maps in a VIT transformer
85 views
Asked by Mohammad Elghandour
module 'torchvision.models' has no attribute 'ViT_B_16_Weights'
97 views
Asked by Pranav Dubal
How to patch intermediate layers of a python keras model with monkey patching?
29 views
Asked by DROS
How is it possible to use a pre-trained ViT backbone of a masked autoencoder in downstream tasks?
84 views
Asked by triggerp420
How do I calculate the accuracy of my Vision Transformer?
116 views
Asked by Sarim
Is it possible to output a specific size of tensors in 'pixel_values' with a transform using HF's Dataset class?
58 views
Asked by Killer Potato
Image transformer model for image inpainting not converging on FashionMNIST
84 views
Asked by Leon
Run onnx model inference with FastAPI
219 views
Asked by chipauris
How can I define reconstruction validation in masked point cloud neural networks?
91 views
Asked by dimes
grayscale images not loading using hugging face and ViT
145 views
Asked by HAMID_Ullah
Transformer augmented cGAN
105 views
Asked by DAMANDEEP SINGH