Help for package LogRegEquiv

Version:

0.1.5

Date:

2022-02-21

Title:

Logistic Regression Equivalence

Description:

Tools for assessing equivalence of similar Logistic Regression models.

Author:

Guy Ashiri-Prossner

Maintainer:

Guy Ashiri-Prossner <guy.ashiri@mail.huji.ac.il>

Encoding:

UTF-8

RoxygenNote:

7.1.1

License:

MIT + file LICENSE

Imports:

stats

Suggests:

knitr, rmarkdown, testthat

VignetteBuilder:

knitr

Depends:

R (≥ 2.10)

LazyData:

true

NeedsCompilation:

Packaged:

2022-02-21 15:10:09 UTC; guy

Repository:

CRAN

Date/Publication:

2022-02-21 15:40:02 UTC

beta_equivalence function

Description

This function takes two logistic regression models M_A, M_B, sensitivity level \delta_\beta and significance level \alpha. It checks whether the coefficient vectors are equivalent.

Usage

beta_equivalence(model_a, model_b, delta, alpha)

Arguments

model_a

logistic regression model M_A

model_b

logistic regression model M_B

delta

equivalence sensitivity level \delta_\beta. This could either be a scalar or a vector with length matching the number of coefficients.

alpha

significance level \alpha

Value

equivalence: are the coefficient vectors equivalent? (boolean)
test_statistic: Equivalence test statistic
critical value: a level-\alpha critical value
ncp: non-centrality parameter
p_value: P-value

brier_score function

Description

This function takes a observations vector y and matching predictions vector \pi. It returns the Brier score for the predictions. Unless specified otherwise, input containing NAs will result with an NA.

Usage

brier_score(y, pi, na.rm = FALSE)

Arguments

y

the obsrevations vector

pi

the predictions vector

na.rm

ignore NA? (optional)

Value

The Brier score \frac{1}{N}\sum_{i=1}^{N}{(y_i-\pi_i)^2}

Examples

brier_score(rbinom(10,1,seq(0.1, 1, 0.1)), seq(0.1, 1, 0.1))

descriptive_equiv function

Description

This function takes two datasets X_A, X_B, regression formula, significance level \alpha and sensitivity level \delta_\beta (either vector or scalar). It builds a logistic regression model for each of the datasets and then checks whether the obtained coefficient vectors are equivalent, using the beta_equivalence function.

Usage

descriptive_equiv(data_a, data_b, formula, delta, alpha = 0.05)

Arguments

data_a

dataset X_A for model M_A

data_b

dataset X_B for model M_B

formula

logistic regression formula

delta

equivalence sensitivity level \delta_\beta

alpha

significance level \alpha (defaults to 0.05)

Value

equivalence: the beta_equivalence function output
model_a: logistic regression model M_A
model_b: logistic regression model M_B

individual_predictive_equiv function

Description

This function takes two logistic regression models M_A, M_B, test data, significance level \alpha and allowed flips ratio r. It checks whether the models produce equivalent log-odds for the given test set and returns various figures.

Usage

individual_predictive_equiv(model_a, model_b, test_data, r = 0.1, alpha = 0.05)

Arguments

model_a

logistic regression model M_A

model_b

logistic regression model M_B

test_data

testing dataset

r

ratio of allowed 'flips' (defaults to 0.1)

alpha

significance level \alpha (defaults to 0.05)

Value

equivalence: Are models M_A,M_B producing equivalent log-odds for the given test data? (boolean)
test_statistic: The test statistic
critical_value: a level-\alpha critical value the test
xi_bar: Mean \xi value for the test
delta_theta: Calculated equivalence parameter
p_value: P-value

performance_equiv function

Description

This function takes two logistic regression models M_A, M_B, test data, significance level \alpha and acceptable score degradation \delta_B. It checks whether the models perform equivalently on the test set and returns various figures.

Usage

performance_equiv(
  model_a,
  model_b,
  test_data,
  dv_index,
  delta_B = 1.1,
  alpha = 0.05
)

Arguments

model_a

logistic regression model M_A

model_b

logistic regression model M_B

test_data

testing dataset

dv_index

column number of the dependent variable

delta_B

acceptable score degradation (defaults to 1.1)

alpha

significance level \alpha (defaults to 0.05)

Value

equivalence: Are models M_A,M_B producing equivalent Brier scores for the given test data? (boolean)
brier_score_ac: M_A Brier score on the testing data
brier_score_bc: M_B Brier score on the testing data
diff_sd_l: SD of the lower Brier difference BS^A-\delta_B^2BS^B
diff_sd_u: SD of the upper Brier difference BS^A-\delta_B^{-2}BS^B
test_stat_l: t_L equivalence boundary for the test
test_stat_u: t_U equivalence boundary for the test
crit_val: a level-\alpha critical value for the test
delta_B: Calculated equivalence parameter
p_value_l: P-value for t_L
p_value_u: P-value for t_U

Student Performance Data Set

Description

Data from a student achievement in secondary education of two Portuguese schools. Full attribute description could be found in the source webpage.

Usage

ptg_stud_data

Format

An object of class data.frame with 649 rows and 31 columns.

Details

The data used is taken from the Student Performance Data. The original data consists of 30 covariates (13 binary, 11 ordinal, 4 categorical, 2 numerical) and a numerical output variable indicating the students final grade in Portuguese Language course.

The data was split by gender (F/M) n_f=383, n_m=266. The target variable G3 was converted to binary, final_fail which indicates the cases where G3 < 10.

This function takes a number \theta and returns its respective sigmoid probability \frac{e^{theta}}{1+e^{theta}}. This is used in logistic regression to model P(y=1|x).

Usage

sigmoid(theta)

Arguments

theta

the linear predictor

Value

the sigmoid probability

Examples

sigmoid(0)

beta_equivalence function

Description

Usage

Arguments

Value

brier_score function

Description

Usage

Arguments

Value

Examples

descriptive_equiv function

Description

Usage

Arguments

Value

individual_predictive_equiv function

Description

Usage

Arguments

Value

performance_equiv function

Description

Usage

Arguments

Value

Student Performance Data Set

Description

Usage

Format

Details

Source

References

See Also

Student Performance Data Set - female testing data

Description

Usage

Format

See Also

Student Performance Data Set - female training data

Description

Usage

Format

See Also

Student Performance Data Set - male testing data

Description

Usage

Format

See Also

Student Performance Data Set - male training data

Description

Usage

Format

See Also

Sigmoid function

Description

Usage

Arguments

Value

Examples