R/cor_tables.R
pcor_matrix.RdThis returns a matrix aligned with the cor_matrix() function after parceling
out the effect of one or more other variables. This function requires complete data;
incomplete cases are dropped with a warning.
pcor_matrix(data, given, ...)Dataframe. Only numeric variables are included into correlation matrix.
A character vector with one or multiple variables in data. It/they will be parceled out from all other variables in data, before the correlation table is calculated.
Arguments passed on to cor_matrix
var_namesA named character vector with new variable names or a tibble as provided by get_rename_tribbles()
for variables. If NULL, then the variables are not renamed. If names are provided, only the variables included here are retained.
This is most helpful when the results are passed to some print function, such as report_cor_table()
conf_levelConfidence level to use for confidence intervals, defaults to .95
bootstrapWhen using FIML estimation (with missing = "fiml"), significance tests and confidence intervals can be bootstrapped. If you want to do that, pass the number of desired bootstrap resamples (e.g., 5000) to this parameter, but beware that this can take a while.
seedPass an integer to set the seed for bootstrapping and thus make this reproducible
methodmethod="pearson" is the default value. The alternatives to be passed to cor are "spearman" and "kendall". These last two are much slower, particularly for big data sets.
adjustWhat adjustment for multiple tests should be used? ("holm", "hochberg", "hommel", "bonferroni", "BH", "BY", "fdr", "none"). See p.adjust for details about why to use "holm" rather than "bonferroni").
# One might want to estimate correlations between health and possible predictors in the ESS
# after parceling out / controling for key demographic attributes:
pcor_matrix(ess_health, given = c("agea", "gndr"),
var_names = c("health" = "Health", "weight" = "Weight", "dosprt" = "Sport")) %>%
tidy(both_directions = FALSE)
#> Warning: Dropped 276 rows with missing data.
#> # A tibble: 3 × 9
#> column1 column2 estimate std.error p.value statistic n conf.low conf.high
#> <chr> <chr> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl>
#> 1 Weight Health 0.144 0.0119 1.23e-33 12.2 6950 0.121 0.167
#> 2 Sport Health -0.175 0.0118 3.64e-49 -14.9 6950 -0.198 -0.153
#> 3 Sport Weight -0.0546 0.0120 5.21e- 6 -4.56 6950 -0.0780 -0.0312