Safe Haskell | None |
---|---|
Language | Haskell2010 |
Hindsight Experience Replay
Documentation
Hindsight Experience Replay Strategies for choosing Goals
Final | Only Final States are additional targets |
Random | Replay with |
Episode | Replay with |
Future | Replay with |
Instances
Eq Strategy Source # | |
Read Strategy Source # | |
Show Strategy Source # | |
Generic Strategy Source # | |
ToJSON Strategy Source # | |
FromJSON Strategy Source # | |
type Rep Strategy Source # | |
Defined in Lib type Rep Strategy = D1 ('MetaData "Strategy" "Lib" "acid-0.1.0.0-Kasfqr97FKi1MHFsFLmEZd" 'False) ((C1 ('MetaCons "Final" 'PrefixI 'False) (U1 :: Type -> Type) :+: C1 ('MetaCons "Random" 'PrefixI 'False) (U1 :: Type -> Type)) :+: (C1 ('MetaCons "Episode" 'PrefixI 'False) (U1 :: Type -> Type) :+: C1 ('MetaCons "Future" 'PrefixI 'False) (U1 :: Type -> Type))) |
Strict Simple/Naive Replay Buffer
Instances
Functor Buffer Source # | Hindsight Experience Replay Buffer is a implements |
Applicative Buffer Source # | This is badly defined and only so it can use |
ReplayBuffer Buffer Source # | Hindsight Experience Replay Buffer implements |
Defined in RPB.HER size :: Buffer Tensor -> Int Source # push :: Int -> Buffer Tensor -> Buffer Tensor -> Buffer Tensor Source # lookUp :: [Int] -> Buffer Tensor -> Buffer Tensor Source # sampleIO :: Int -> Buffer Tensor -> IO (Buffer Tensor) Source # asTuple :: Buffer Tensor -> (Tensor, Tensor, Tensor, Tensor, Tensor) Source # collectExperience :: Agent a => Params -> CircusUrl -> Tracker -> Int -> a -> IO (Buffer Tensor) Source # | |
Eq a => Eq (Buffer a) Source # | |
Show a => Show (Buffer a) Source # | |