experiments.experiment_utils.generate_episodes_and_calc_J

generate_episodes_and_calc_J(**kwargs)

Calculate the expected discounted return by generating episodes

Returns:

(episodes, J), where episodes is the list of generated ground truth episodes and J is the expected discounted return

Return type:

(List(Episode),float)