TechQA.

Question

Getting a Memory Out Error while Multiplying two 4D tensors with shape (1, 4, 2097152, 32)

score 31 · Answer 1 · 2024-03-28T18:58:17.310000

0

Answer

31

Views

Getting a Memory Out Error while Multiplying two 4D tensors with shape (1, 4, 2097152, 32)

31 views Asked by Oshan Devinda At 28 March 2024 at 18:58

score 26 · Answer 2 · 2024-03-25T20:38:32.733000

Custom patch embedding layer for pre-trained Vision transformers

26 views Asked by paper At 25 March 2024 at 20:38

score 17 · Answer 3 · 2024-03-11T07:40:43.107000

Constant Accuracy in Swin Transformer Training: Why is accuracy not improving?

17 views Asked by Pranav Dubal At 11 March 2024 at 07:40

score 31 · Answer 4 · 2024-03-11T00:05:50.687000

This code runs perfectly but I wonder what the parameter 'x' in my_forward function refers to

31 views Asked by Mohammad Elghandour At 11 March 2024 at 00:05

score 85 · Answer 5 · 2024-03-09T22:24:29.583000

visualizing attention maps in a VIT transformer

85 views Asked by Mohammad Elghandour At 09 March 2024 at 22:24

score 97 · Answer 6 · 2024-03-06T08:56:25.420000

module 'torchvision.models' has no attribute 'ViT_B_16_Weights'

97 views Asked by Pranav Dubal At 06 March 2024 at 08:56

score 29 · Answer 7 · 2024-03-05T11:04:52.010000

How to patch intermediate layers of a python keras model with monkey patching?

29 views Asked by DROS At 05 March 2024 at 11:04

score 97 · Answer 8 · 2024-03-04T05:08:35.077000

Vision Transformer (ViT) implementation in Pytorch keeps returning same class label in output tensors

97 views Asked by Sarim At 04 March 2024 at 05:08

score 33 · Answer 9 · 2024-03-04T03:28:36.337000

Error mat1 and mat2 shapes cannot be multiplied (30x50176 and 768x768) in Vision Transformer crom scratch Pytorch

33 views Asked by Fuji At 04 March 2024 at 03:28

score 84 · Answer 10 · 2024-02-26T14:13:43.427000

How is it possible to use a pre-trained ViT backbone of a masked autoencoder in downstream tasks?

84 views Asked by triggerp420 At 26 February 2024 at 14:13

score 130 · Answer 11 · 2024-02-18T19:40:57.893000

Why do my predicted output tensors always return same class label? (have fairly balanced dataset, assuming its something to do with my class weights)

score 116 · Answer 12 · 2024-02-18T04:04:12.730000

How do I calculate the accuracy of my Vision Transformer?

116 views Asked by Sarim At 18 February 2024 at 04:04

score 58 · Answer 13 · 2024-01-12T17:22:19.207000

Is it possible to output a specific size of tensors in 'pixel_values' with a transform using HF's Dataset class?

58 views Asked by Killer Potato At 12 January 2024 at 17:22

score 84 · Answer 14 · 2023-12-20T01:25:40.160000

Image transformer model for image inpainting not converging on FashionMNIST

84 views Asked by Leon At 20 December 2023 at 01:25

score 219 · Answer 15 · 2023-12-15T22:28:17.553000

Run onnx model inference with FastAPI

219 views Asked by chipauris At 15 December 2023 at 22:28

score 118 · Answer 16 · 2023-12-14T09:49:46.323000

I want to change my VIT from single-label multi-class classification to multi-label. How should I rewrite the evaluation and loss sections?

118 views Asked by Jade Roy At 14 December 2023 at 09:49

score 91 · Answer 17 · 2023-12-09T14:09:22.067000

How can I define reconstruction validation in masked point cloud neural networks?

91 views Asked by dimes At 09 December 2023 at 14:09

score 145 · Answer 18 · 2023-11-29T12:38:59.337000

grayscale images not loading using hugging face and ViT

145 views Asked by HAMID_Ullah At 29 November 2023 at 12:38

score 105 · Answer 19 · 2023-11-14T11:08:18.543000

Transformer augmented cGAN

105 views Asked by DAMANDEEP SINGH At 14 November 2023 at 11:08

score 246 · Answer 20 · 2023-10-09T23:50:06.877000

TypeError: Object of type ViTConfig is not JSON serializable when pushing a custom ViT model to Hugging Face Hub

246 views Asked by Reza At 09 October 2023 at 23:50

TechQA.

List Question

Getting a Memory Out Error while Multiplying two 4D tensors with shape (1, 4, 2097152, 32)

Custom patch embedding layer for pre-trained Vision transformers

Constant Accuracy in Swin Transformer Training: Why is accuracy not improving?

This code runs perfectly but I wonder what the parameter 'x' in my_forward function refers to

visualizing attention maps in a VIT transformer

module 'torchvision.models' has no attribute 'ViT_B_16_Weights'

How to patch intermediate layers of a python keras model with monkey patching?

Vision Transformer (ViT) implementation in Pytorch keeps returning same class label in output tensors

Error mat1 and mat2 shapes cannot be multiplied (30x50176 and 768x768) in Vision Transformer crom scratch Pytorch

How is it possible to use a pre-trained ViT backbone of a masked autoencoder in downstream tasks?

Why do my predicted output tensors always return same class label? (have fairly balanced dataset, assuming its something to do with my class weights)

How do I calculate the accuracy of my Vision Transformer?

Is it possible to output a specific size of tensors in 'pixel_values' with a transform using HF's Dataset class?

Image transformer model for image inpainting not converging on FashionMNIST

Run onnx model inference with FastAPI

I want to change my VIT from single-label multi-class classification to multi-label. How should I rewrite the evaluation and loss sections?

How can I define reconstruction validation in masked point cloud neural networks?

grayscale images not loading using hugging face and ViT

Transformer augmented cGAN

TypeError: Object of type ViTConfig is not JSON serializable when pushing a custom ViT model to Hugging Face Hub

Popular Questions

Trending Questions