Runs a LinTS or non-contextual TS bandit experiment, given potential outcomes and covariates.
Arguments
- ys
Matrix. Potential outcomes of shape
[A, K]
, whereA
is the number of observations andK
is the number of arms. Must not contain NA values.- floor_start
Numeric. Specifies the initial value for the assignment probability floor. It ensures that at the start of the process, no assignment probability falls below this threshold. Must be a positive number.
- floor_decay
Numeric. Decay rate of the floor. The floor decays with the number of observations in the experiment such that at each point in time, the applied floor is:
floor_start/(s^{floor_decay})
, wheres
is the starting index for a batched experiment, or the observation index for an online experiment. Must be a number between 0 and 1 (inclusive).- batch_sizes
Integer vector. Size of each batch. Must be positive integers.
- xs
Optional matrix. Covariates of shape
[A, p]
, wherep
is the number of features, if the LinTSModel is contextual. Default isNULL
. Must not contain NA values.- balanced
Optional logical. Indicates whether to balance the batches. Default is
NULL
.