Specifying observation space for Q-Mix in ray

Question

Specifying observation space for Q-Mix in ray

253 views Asked by ckorzhik At 17 November 2022 at 10:19

I see that I have to define players observations for using Qmix + LSTM as here https://github.com/ray-project/ray/issues/8407#issuecomment-627401186 or as in this example https://github.com/ray-project/ray/blob/master/rllib/examples/two_step_game.py#L81

However, I don't understand what I should put into ENV_STATE.

Is this field for states that player may be in? Are there any restrictions for them? Are they connected with observations (the field that is near) in any way?

Original Q&A

There are 1 answers

**ckorzhik** · Answer 1 · 2022-12-01T08:58:38+00:00

ENV_STATE represents environment state dimension, and obs represents dimension of observations.

However, it will not magically work for any environment. You have to wrap your observations and environment state in dictionary as in this example https://github.com/ray-project/ray/blob/1.11.1/rllib/examples/env/two_step_game.py#L85 so that your environment returns it after every step and on reset().

After that, you can use with_agent_groups.

As you can see from the qmix sources, you can also define action masks in the same dictionary https://github.com/ray-project/ray/blob/1.11.1/rllib/agents/qmix/qmix_policy.py#L93

TechQA.

Specifying observation space for Q-Mix in ray

There are 1 answers

Related Questions in REINFORCEMENT-LEARNING

Related Questions in RAY

Related Questions in MULTI-AGENT

Related Questions in MULTI-AGENT-REINFORCEMENT-LEARNING

Popular Questions

Popular Tags

Trending Questions