RPB.ERE
Description
Emphasizing Recent Experiences
samplingRange Source #
Arguments
Buffer Size N
Number of Epochs K
Current Epoch k
cMin
η
cK
Calculate ERE Sampling range cK
sample :: Buffer Tensor -> Int -> Int -> Int -> Int -> Int -> Float -> IO (Buffer Tensor) Source #
Sample for buffer within ERE range
anneal Source #
Initial η0
Final ηt
Horizon T
Current step t
Current ηt
ERE η Annealing during training