reagent.gym.policies package

Submodules

reagent.gym.policies.policy module

class reagent.gym.policies.policy.Policy(scorer: Callable[[Any], Any], sampler: reagent.gym.types.Sampler)

Bases: object

act(obs: Any) → reagent.types.ActorOutput

Performs the composition described above. These are the actions being put into the replay buffer, not necessary the actions taken by the environment!

reagent.gym.policies.predictor_policies module

reagent.gym.policies.random_policies module

Module contents

class reagent.gym.policies.Policy(scorer: Callable[[Any], Any], sampler: reagent.gym.types.Sampler)

Bases: object

act(obs: Any) → reagent.types.ActorOutput

Performs the composition described above. These are the actions being put into the replay buffer, not necessary the actions taken by the environment!