This method corrects the mean adjusted agreement by a permutation approach and generates the relation parameter mutual forest impact. Subsequently p-values are determined and related variables are selected.
Arguments
- variables
Vector of variable names for which related variables should be searched.
- candidates
Vector of variable names that are candidates to be related to the variables.
- num.threads
Number of threads to parallelize with. (Default: 1)
- ...
Arguments passed on to
RandomForestSurrogatesx,yPredictor data and dependent variables.
s.pct,sNumber of surrogate splits. This can be defined either by setting
s.pctto a number between 0 and 1, or providing an exact value fors.s.pct: Percentage of variables to use fors. (Default: 0.01)s: Number of surrogate splits. (Default: Number of variables multiplied bys.pct, which defaults to 0.01; Ifs.pctis less than or equal to zero, or greater than 1: 0.01 is used instead.)
mtryNumber of variables to possibly split at in each node. Default is the (rounded down) number of variables to the power of three quarters (Ishwaran, 2011). Alternatively, a single argument function returning an integer, given the number of independent variables.
typeThe type of random forest to create with ranger. One of
"regression"(Default),"classification"or"survival".statusIf
type = "regression": Survival forest status variable. Use 1 for event and 0 for censoring. Length must matchy.min.node.sizeMinimal node size to split at. (Default: 1)
permutateEnable to permutate
xforMutualForestImpact()(Default: FALSE).seedRNG seed. It is strongly recommended that you set this value.
preschedule.threads(Default: TRUE) Passed as
mc.prescheduletoparallel::mclapply()inaddSurrogates().num.treesNumber of trees.
Value
A MutualForestImpact() list object.
REL: TheMeanAdjustedAgreement()object.PERM: The permutatedMeanAdjustedAgreement()object.relations: Matrix of determined relations (rows: investigated variables, columns: candidate variables).
Examples
# \donttest{
data("SMD_example_data")
mfi <- MFI(
x = SMD_example_data[, -1], y = SMD_example_data[, 1],
s = 10, num.trees = 50, num.threads = 1,
variables = c("X7", "X1"), candidates = colnames(SMD_example_data)[2:101]
)
#> Warning: `seed` was not set. Your results may not be reproducible.
#> Warning: `seed` was not set. Your results may not be reproducible.
# }