experiments.experiment_utils.generate_episodes_and_calc_J¶

generate_episodes_and_calc_J(**kwargs)¶

Calculate the expected discounted return by generating episodes

Returns:: (episodes, J), where episodes is the list of generated ground truth episodes and J is the expected discounted return
Return type:: (List(Episode),float)