Is Q-Learning Algorithm's implementation recursive?

Question

Is Q-Learning Algorithm's implementation recursive?

940 views Asked by dariush At 04 December 2014 at 11:44

I am trying to implement the Q-Learning. The general algorithm from here is as below

enter image description here

In the statement

enter image description here

I just don't get it that should i implement the above statement of the original pseudo-code recursively for all next states which current state/action can lead us to and max it every time

OR just choose the maximum value of the next state with current action from the Action-State Q-Value table?

Thanks in advance.

Original Q&A

There are 1 answers

**Don Reba** · Accepted Answer · 2014-12-04T11:58:50+00:00

Don Reba On 04 December 2014 at 11:58 BEST ANSWER

All the formula says is that on step t+1 you update the state-action value by using the state-action value from step t and the maximum of values over all the actions for the current state.

TechQA.

Is Q-Learning Algorithm's implementation recursive?

There are 1 answers

Related Questions in ALGORITHM

Related Questions in RECURSION

Related Questions in REINFORCEMENT-LEARNING

Related Questions in Q-LEARNING

Popular Questions

Popular Tags

Trending Questions