seldonian.RL.environments.mountaincar.Mountaincar¶

class Mountaincar¶

Bases: Environment

__init__()¶

Classic Mountaincar environment with hardcoded position and velocity bounds. Actions: -1,0,1 -> force left, no force, force right.

Variables:

env_description (Env_Description) – contains attributes describing the environment
terminal_state (bool) – Whether the terminal obs is occupied
time (int) – The current timestep
position (float) – The 1D physical position of the car, initialized at -0.5.
velocity (float) – The 1D velocity of the car, initialized at 0.0.
max_time (int) – Maximum allowed timestep

Methods

check_valid_mc_action(action)¶

Checks to ensure a valid action was taken.

create_env_description()¶

Creates the environment description object.

get_env_description()¶: Get environment description. Override this method in child class implementation

position_and_termination_update()¶: Update the position given the current velocity. Check to see if we have gone outside position bounds. Also check to see if we have reached the goal position.

transition(action)¶

Transition between states given an action, return a reward.

update_velocity(action)¶

Apply the velocity update rule

visualize()¶: Print out current observation, useful for debugging. Override this method in child class implementation

Seldonian Engine