How discount factor is taken into account in stable baselines 3 on policies methods i.e. PPO?

47 views Asked by nrg At 26 October 2023 at 09:43

I would like to understand how gamma have an impact on the learnt policy. I cannot understand if the final reward has a linear or an exponential discount.

I would expect the final reward to be something like

R = sum_i gamma ^ (i) * rew_i

but I cannot find this in the main code. Thank you

Original Q&A

TechQA.

How discount factor is taken into account in stable baselines 3 on policies methods i.e. PPO?

There are 0 answers

Related Questions in REINFORCEMENT-LEARNING

Related Questions in DISCOUNT

Related Questions in STABLE-BASELINES

Related Questions in GAMMA

Popular Questions

Popular Tags

Trending Questions