learning estimated value AND expected temporal-difference error

32 views Asked by user3510164 At 13 March 2020 at 14:17

How could I best let my network learn not only the expected value but also the expected variation around that value, a measure of uncertainty. For any state the network has never seen before this would be very high, for any state that the network has seen many times it should approach some estimate of the expected variation.

Wondering if one can "learn" both aspects at the same time with a (potentially partially) overlapping network.

Original Q&A

TechQA.

learning estimated value AND expected temporal-difference error

There are 0 answers

Related Questions in MACHINE-LEARNING

Related Questions in REINFORCEMENT-LEARNING

Related Questions in TEMPORAL-DIFFERENCE

Popular Questions

Trending Questions