For questions related to deep reinforcement learning (DRL), that is, RL combined with deep learning. More precisely, deep neural networks are used to represent e.g. value functions or policies.
Questions tagged [deep-rl]
511 questions
2
votes
2 answers
Is Bellman backup unbiased?
This is comes from cs285 2023Fall hw3.
In my opinion, if $\hat{Q}$ is unbiased estimate of $Q$, then
$$
\begin{align}
\mathbb{E}_{D \sim P}[B_{D}\hat{Q} - B_{D}Q]
&= \mathbb{E}_{D \sim P}[r(s,a) + \gamma max_{a'}\hat{Q}(s', a') - r(s,a) - \gamma…
yeebo xie
- 45
- 5
1
vote
2 answers
How to handle the dead agent in multi-agent environment?
I try to implement deep reinforcement learning on a defender-vs-attacker problem, where agents can be destroyed by enemies. I am coding both the environment and the RL algorithm. The agent can observe his own state and other's. As we know, the input…
zhixin
- 43
- 4