How correctly format the dataset to train a llama2 alpaca fine tuned model?

237 views Asked by celsowm At 27 November 2023 at 17:36

I would like to fine tunning a llama2-alpaca model called bode.

I have a web-scrapped dataset of questions and answers and I would like to use it on SFTTrainer to fine tunning that model to this specific domain but I don't know how correctly format the dataset to this model because on hugging face documentation is something like this:

<s>[INST] <<SYS>>
{{ system_prompt }}
<</SYS>>

{{ user_message }} [/INST]

But on this very model datacard, they suggest something like:

Abaixo está uma instrução que descreve uma tarefa. Escreva uma resposta que complete adequadamente o pedido.

### Instrução:
{instruction}

### Resposta:"""

So, is there a method to get via API the prompt used? If so, what kind of modification do I need to do pass it to SFTTrainer?

Original Q&A

TechQA.

How correctly format the dataset to train a llama2 alpaca fine tuned model?

There are 0 answers

Related Questions in HUGGINGFACE-TRANSFORMERS

Related Questions in LLAMA

Related Questions in ALPACA

Popular Questions

Popular Tags

Trending Questions