PixelControl

class PixelControl(feature, behavior_policy: pandemonium.policies.policy.Policy, channels: int = 32, kernel: int = 4, stride: int = 2, **kwargs)

Bases: pandemonium.implementations.rainbow.DQN

Duelling de-convolutional network for auxiliary pixel control task

References

RL with unsupervised auxiliary tasks (Jaderberg et al. 2016)