List Question
20 TechQA 2024-03-29T19:24:58.130000Understanding batching in pytorch models
35 views
Asked by Mahesha999
Using an upstream-downstream ML model, with the upstream being Wav2Vec 2.0 transformer and the downstream CNN. The model's accuracy is plateaued, why?
13 views
Asked by Fathima Hanan Parakkot
How to obtain latent vectors from fine-tuned model with transformers
17 views
Asked by Mario Alvarez
What is the difference between PEFT and RAFT?
21 views
Asked by Krishna
Improving Train Punctuality Prediction Using a Transformer Model: Model Setup and Performance Issues
13 views
Asked by dancing_rabbit_2442
How to remove layers in Huggingface's transformers GPT2 pre-trained models?
31 views
Asked by dark kk
NPL Keras transformers model not converging
19 views
Asked by ary soft
How to convert pretrained hugging face model to .pt and run it fully locally?
61 views
Asked by vonexel
LLaMA2 Workload Traces
14 views
Asked by Bipul Bikram Thapa
Inference question through LoRA in Whisper model
30 views
Asked by C yp
is there any way to use RL for decoder only models
11 views
Asked by rohit jindal
What's the exact input size in MultiHead-Attention of BERT?
16 views
Asked by TomWu
How to solve this error "UnsupportedOperation: fileno"
24 views
Asked by Deb
Transformers // Predicting next transaction based on sequence of previous transactions // Sequence2One task
20 views
Asked by Timofey_Zubashev
I was using colab: I want to run a .py file having argparse function to train a model
70 views
Asked by Wahab Al Labib
Feeding a Transformer with a matrix
14 views
Asked by DANI QS
nn.TransformerDecoder output the same result from the second frames
15 views
Asked by 阳铠行
I want to convert my TrOCR model into TFLite version
16 views
Asked by HAMZA MASSAR
Using the ENCODE function
14 views
Asked by THANH HOÀNG
Transformer for time series data
18 views
Asked by Oussama_rob