seldonian.utils.tutorial_utils.make_synthetic_regression_dataset

make_synthetic_regression_dataset(num_points, loc_X=0.0, loc_Y=0.0, sigma_X=1.0, sigma_Y=1.0, clipped=False, clip_min=-3, clip_max=3)

Generate 2D dataset from random normal distributions, with optional clipping

Parameters:
  • num_points (int) – The number of data points to generate

  • loc_X (float) – The mean of the normal distribution in the X dimension

  • loc_Y (float) – The mean of the normal distribution in the Y dimension

  • sigma_X (float) – The standard deviation of the normal distribution in the X dimension

  • sigma_Y (float) – The standard deviation of the normal distribution in the Y dimension

  • clipped – Boolean flag controlling whether to use clipping

  • clip_min – Clip values to be >= this value

Returns:

SupervisedDataSet object