SkewedER

class SkewedER(*args, **kwargs)

Bases: pandemonium.experience.buffers.ER

Skewed Experience Replay buffer.

Used in the UNREAL architecture for the auxiliary reward prediction task.

References

RL with unsupervised auxiliary tasks (Jaderberd et al., 2016)

Methods Summary

add(self, transition, weight)

sample(self, batch_size, contiguous)

Randomly draws a batch of transitions

Methods Documentation

add(self, transition: pandemonium.experience.experience.Transition, weight: float = None) → None
sample(self, batch_size: int = None, contiguous: bool = True) → List[pandemonium.experience.experience.Transition]

Randomly draws a batch of transitions

Parameters
  • batch_size – Number of transitions to sample from the buffer.

  • contiguous – Whether transitions should be contiguous or not. This is particularly useful when using \(n\)-step methods.