Spinning Up
Reference
Architectures
Benchmarks
DuellingMixin
Bases: object
object
Mixin for value-based control algorithms that uses two separate estimators for action-values and state-values.
References
https://arxiv.org/pdf/1511.06581