DuellingMixin

class DuellingMixin

Bases: object

Mixin for value-based control algorithms that uses two separate estimators for action-values and state-values.

References

“Dueling Network Architectures for DRL” by Wang et al.

https://arxiv.org/pdf/1511.06581