Safe Haskell | None |
---|---|
Language | Haskell2010 |
Prioritized Experience Replay
Synopsis
- data Buffer a = Buffer {}
- mkBuffer :: Int -> Float -> Float -> Int -> Buffer Tensor
- empty :: Int -> Float -> Float -> Int -> Buffer Tensor
- push :: Buffer Tensor -> Tensor -> Tensor -> Tensor -> Tensor -> Tensor -> Buffer Tensor
- push' :: Buffer Tensor -> Buffer Tensor -> Buffer Tensor
- sampleIO :: Buffer Tensor -> Int -> Int -> IO (Buffer Tensor, Tensor, Tensor)
- update :: Buffer Tensor -> Tensor -> Tensor -> Buffer Tensor
- betaByFrame :: Float -> Int -> Int -> Float
Documentation
Strict Prioritized Experience Replay Buffer
push :: Buffer Tensor -> Tensor -> Tensor -> Tensor -> Tensor -> Tensor -> Buffer Tensor Source #
Push new memories in a Buffer
push' :: Buffer Tensor -> Buffer Tensor -> Buffer Tensor Source #
Syntactic Sugar for adding one buffer to another
sampleIO :: Buffer Tensor -> Int -> Int -> IO (Buffer Tensor, Tensor, Tensor) Source #
Take a prioritized sample from the Buffer