experiments.experiment_utils.generate_episodes_and_calc_J¶
- generate_episodes_and_calc_J(**kwargs)¶
Calculate the expected discounted return by generating episodes
- Returns:
(episodes, J), where episodes is the list of generated ground truth episodes and J is the expected discounted return
- Return type:
(List(Episode),float)