Handbook

Overview

This is the API documentation for statistical hypothesis tests.

Student t-test (1 sample)

`fsml_ttest_1sample`

Description

The 1-sample t-test determines if the sample mean has the value specified in the null hypothesis.

The null hypothesis $H_{0}$ and alternative hypothesis $H_{1}$ can be written as: $H_{0}$ : $\bar{x} = \mu_0$ , and $H_{1}$ : $\bar{x} \neq \mu_0$

The optional argument h1 lets you specify your alternative hypothesis further as "greater than" (h1 = "gt"), "less than" (h1 = "lt"), or "two-sided" (h1 = "two").

The test statstic $t$ is calculated as follows: $t = \frac{\bar{x} - \mu_0}{s / \sqrt{n}}$ where $x$ (x) are sample observations, $\bar{x}$ is the sample mean, $s$ is the sample standard deviation, $n$ is the sample size, and $\mu_0$ (mu0) is the population mean (null hypothesis expected value).

The procedure computes the test statistic $t$ (t), the degrees of freedom df ( $\nu = n - 1$ ), and the p-value (p).

Syntax

call fsml_ttest_1sample(x, mu0, t, df, p [, h1])

Parameters

x: A rank-1 array of type real.

mu0: A scalar of type real.

h1: An optional argument and character string. If passed, it must be one of the following: "lt", "gt", or "two". If not passed, it will default to "two".

Invalid argument values will result in the return of a sentinel value.

Returns

t: A scalar of the same type as x.

df: A scalar of the same type as x.

p: A scalar of the same type as x.

Paired sample t-test

`fsml_ttest_paired`

Description

The paired sample t-test (or dependent sample t-test) determines if the mean difference between two sample sets are zero. It is mathematically equivalent to the 1-sample t-test conducted on the difference vector $d$ with $\mu_0 = 0$ .

The null hypothesis $H_{0}$ and alternative hypothesis $H_{1}$ can be written as: $H_{0}$ : $\bar{d} = 0$ , and $H_{1}$ : $\bar{d} \neq 0$

The optional argument h1 lets you specify your alternative hypothesis further as "greater than" (h1 = "gt"), "less than" (h1 = "lt"), or "two-sided" (h1 = "two").

The test statstic $t$ is calculated as follows: $t = \frac{\bar{d} - 0}{s_d / \sqrt{n}}$ where $\bar{d}$ is the mean of the differences between the sample sets (x1 and x2), $s_d$ is the standard deviation of the differences, and $n$ is the number of paired samples.

The procedure computes the test statistic $t$ (t), the degrees of freedom df ( $\nu = n - 1$ ), and the p-value (p).

Syntax

call fsml_ttest_paired(x1, x2, t, df, p [, h1])

Parameters

x1: A rank-1 array of type real. It must be the same size as x2.

x2: A rank-1 array of type real. It must be the same size as x1.

h1: An optional argument and character string. If passed, it must be one of the following: "lt", "gt", or "two". If not passed, it will default to "two".

Invalid argument values will result in the return of a sentinel value.

Returns

t: A scalar of the same type as x.

df: A scalar of the same type as x.

p: A scalar of the same type as x.

Pooled and Welch's t-test (2 sample)

`fsml_ttest_2sample`

Description

The 2-sample t-test determines if two population means $\mu_1$ and $\mu_2$ are the same. The procedure handles 2-sample t-tests for equal variances and Welch's t-tests for unequal variances.

The null hypothesis $H_{0}$ and alternative hypothesis $H_{1}$ can be written as: $H_{0}$ : $\mu_1 \ = \mu_2$ , and $H_{1}$ : $\mu_1 \ \neq \mu_2$

The optional argument h1 lets you specify your alternative hypothesis further as "greater than" (h1 = "gt"), "less than" (h1 = "lt"), or "two-sided" (h1 = "two").

The procedure computes the test statistic $t$ (t), the degrees of freedom $\nu$ (df), and the p-value (p).

The procedure defaults to Welch's t-test for unequal variances (eq_var = .false.) if not specified differently. In this case, the test statstic $t$ (t) is calculated as follows: $t = \frac{\bar{x}_1 - \bar{x}_2}{\sqrt{\frac{s^2_1}{n_1} + \frac{s^2_2}{n_2} }}$ where $\bar{x}_1$ and $\bar{x}_2$ are the sample means $s^2_1$ and $s^2_2$ are the sample standard deviations, and $n_1$ and $n_1$ are the sample sizes.

The degrees of freedom $\nu$ (df) is approximated with the Welch–Satterthwaite equation: $\nu = \frac{\left( \frac{s_1^2}{n_1} + \frac{s_2^2}{n_2} \right)^2} {\frac{\left( \frac{s_1^2}{n_1} \right)^2}{n_1 - 1} + \frac{\left( \frac{s_2^2}{n_2} \right)^2}{n_2 - 1}}$

If variances are assumed to be equal (eq_var = .true.), the procedure conducts a 2 sample t-test for equal variances, using the pooled standard deviation $s_p$ to calculate the t-statistic: $t = \frac{\bar{x}_1 - \bar{x}_2}{s_p \sqrt{\frac{1}{n_1} + \frac{1}{n_2}}}$

In case of assumed equal variances, the degrees of freedom (df) is calculated as follows: $\nu = n_1 + n_2 - 2$

Syntax

call fsml_ttest_2sample(x1, x2, t, df, p [, eq_var, h1])

Parameters

x1: A rank-1 array of type real.

x2: A rank-1 array of type real.

eq_var: An optional argument of type logical. If not passed, it will default to false.

h1: An optional argument and character string. If passed, it must be one of the following: "lt", "gt", or "two". If not passed, it will default to "two".

Invalid argument values will result in the return of a sentinel value.

Returns

t: A scalar of the same type as x.

df: A scalar of the same type as x.

p: A scalar of the same type as x.

Analysis of variance (one way)

`fsml_anova_1way`

Description

The one-way ANOVA (Analysis of Variance) tests whether three or more population means $\mu_1, \mu_2, \dots, \mu_k$ are equal.

The null hypothesis $H_0$ and alternative hypothesis $H_1$ are defined as: $H_0$ : $\mu_1 = \mu_2 = \cdots = \mu_k$ , and $H_1$ : At least one $\mu_j$ differs from the others.

The data is passed to the procedure as a rank-2 array x, where each column is a group of observations. The procedure partitions the total variability in the data ( $SS_{total}$ ) into variability between groups ( $SS_{between}$ ; variability explained by groups ), and variability within groups ( $SS_{within}$ ; unexplained or residual variability ), so that $SS_{total} = SS_{between} + SS_{within}$

The F-statistic (f) is the ratio of the mean sum of squares between groups to the mean sum of squares within groups: $F = \frac{SS_{between} / (k - 1)}{SS_{within} / (n - k)}$ where $k$ is the number of groups, $n$ is the total number of observations, $SS_{between}$ is the sum of squares between groups, and $SS_{within}$ is the sum of squares within groups.

The degrees of freedom are $\nu_1 = k - 1$ between groups (df_b) and $\nu_2 = n - k$ within groups (df_w).

The resulting p-value (p) is computed from the F-distribution: $p = P(F_{\nu_1, \nu_2} > F_{observed}) = 1 - \text{CDF}(F_{observed})$

The ANOVA makes the assumptions that a) the groups are independent, b) the observations within each group are normally distributed, and c) The variances within groups are equal.

Syntax

call fsml_anova_1way(x, f, df_b, df_w, p)

Parameters

x: A rank-2 array of type real. It must have at least 2 rows and 2 columns.

Invalid argument values will result in the return of a sentinel value.

Returns

f: A scalar of the same type as x.

df_b: A scalar of the same type as x.

df_w: A scalar of the same type as x.

p: A scalar of the same type as x.

Wilcoxon signed-rank (1 sample)

`fsml_signedrank_1sample`

Description

The 1-sample Wilcoxon signed rank test is a non-parametric test that determines if data (x) comes from a symmetric population with centre $mu_0$ (mu0). It can be regarded as a non-parametric version of the 1-sample t-test.

If the data consists of independent and similarly distributed samples from distribution $D$ , the null hypothesis $H_0$ can be expressed as:

$D$ is symmetric around $\mu = \mu_0$ .

The default alternative hypothesis $H_1$ is two-sided and also be set explicitly (h1 = "two"). It can be expressed as:

$D$ is symmetric around $\mu \neq \mu_0$

If the alternative hypothesis is set to "greater than" (h1 = "gt"), it is:

$D$ is symmetric around $\mu > \mu_0$

If the alternative hypothesis is set to "less than" (h1 = "lt"), it is:

$D$ is symmetric around $\mu < \mu_0$

The test statistic $W$ (w) is the smaller of the sum of positive and negative signed ranks: $W = \min \left( \sum_{d_i > 0} R_i, \sum_{d_i < 0} R_i \right)$

The procedure computes the W statistic (w) and the p-value (p).

The procedure takes into consideration tied ranks.

Syntax

call fsml_signedrank_1sample(x, mu0, w, p [,h1])

Parameters

x: A rank-1 array of type real.

mu0: A scalar of type real.

h1: An optional argument and character string. If passed, it must be one of the following: "lt", "gt", or "two". If not passed, it will default to "two".

Invalid argument values will result in the return of a sentinel value.

Returns

w: A scalar of the same type as x.

p: A scalar of the same type as x.

Wilcoxon signed-rank (paired)

`fsml_signedrank_paired`

Description

The Wilcoxon signed rank test is a non-parametric test that determines if two related paired samples $x_1$ and $x_2$ (x1 and x2) come from the same distribution. It can be regarded as a non-parametric version of the paired t-test.

The Wilcoxon signed rank test is mathematically equivalent to the 1-sample Wilcoxon signed rank test conducted on the difference vector $d = x_1 - x_2$ with $mu_0$ set to zero. Consequently, the the null hypothesis $H_0$ can be expressed as:

Samples $x_1 - x_2$ are symmetric around $\mu = 0$ .

The default alternative hypothesis $H_1$ is two-sided and also be set explicitly (h1 = "two"). It can be expressed as:

Samples $x_1 - x_2$ are symmetric around $\mu \neq 0$

If the alternative hypothesis is set to "greater than" (h1 = "gt"), it is:

Samples $x_1 - x_2$ are symmetric around $\mu > 0$

If the alternative hypothesis is set to "less than" (h1 = "lt"), it is:

Samples $x_1 - x_2$ are symmetric around $\mu < 0$

The test statistic $W$ (w) is the smaller of the sum of positive and negative signed ranks: $W = \min \left( \sum_{d_i > 0} R_i, \sum_{d_i < 0} R_i \right)$

The procedure computes the W statistic (w) and the p-value (p).

The procedure takes into consideration tied ranks.

Syntax

call fsml_signedrank_paired(x1, x2, w, p [,h1])

Parameters

x1: A rank-1 array of type real. It must be the same size as x2.

x2: A rank-1 array of type real. It must be the same size as x1.

h1: An optional argument and character string. If passed, it must be one of the following: "lt", "gt", or "two". If not passed, it will default to "two".

Invalid argument values will result in the return of a sentinel value.

Returns

w: A scalar of the same type as x.

p: A scalar of the same type as x.

Mann–Whitney U rank-sum (2 sample)

`fsml_ranksum`

Description

The ranks sum test (Wilcoxon rank-sum test or Mann–Whitney U test) is a non-parametric test to determine if two independent samples $x_1$ and $x_2$ (x1 and x2) are have the same distribution. It is the non-parametric equivalent of the 2-sample t-test.

The null hypothesis $H_{0}$ and alternative hypothesis $H_{1}$ can be written as: $H_0$ : the distributions of $x_1$ and $x_2$ are equal. $H_1$ : the distributions of $x_1$ and $x_2$ are not equal.

The Mann–Whitney U statistic is calculated for each sample as follows: $U_i = R_i - \frac{n_i \cdot (n_i + 1)}{2}$ where $R_i$ is the sum of ranks of sample set $i$ and $n_i$ is the sample size of sample set $i$ . The final U statistic is: $U = \min(U_1, U_2)$

The procedure computes the U statistic (u) and the p-value (p).

The procedure takes into consideration tied ranks.

Syntax

call fsml_ranksum(x1, x2, u, p [,h1])

Parameters

x1: A rank-1 array of type real.

x2: A rank-1 array of type real.

h1: An optional argument and character string. If passed, it must be one of the following: "lt", "gt", or "two". If not passed, it will default to "two".

Invalid argument values will result in the return of a sentinel value.

Returns

u: A scalar of the same type as x.

p: A scalar of the same type as x.

Kruskall Wallis H test

`fsml_kruskalwallis`

Description

The Kruskal-Wallis H-test is used to determine whether samples originate from the same distribution without assuming normality. It is therefore considered a nonparametric alternative to the one-way ANOVA (Analysis of Variance).

The null hypothesis $H_0$ and alternative hypothesis $H_1$ are defined as: $H_0$ : The populations have the same distribution (medians are equal), and $H_1$ : At least one population differs from the others.

The data is passed to the procedure as a rank-2 array x, where each column is a group of observations. All values are ranked across the entire dataset, with tied values assigned the average rank.

The Kruskal-Wallis H-statistic (h) is computed as: $H = \frac{12}{n(n+1)} \sum_{j=1}^{k} \frac{R_j^2}{n_j} - 3(n+1)$

where: - $n$ is the total number of observations, - $k$ is the number of groups, - $n_j$ is the number of observations in group $j$ , and - $R_j$ is the sum of ranks in group $j$ .

The degrees of freedom are: $\nu = k - 1$ and returned as df.

The p-value (p) is computed from the chi-squared distribution: $p = P(\chi^2_{\nu} > H_{observed}) = 1 - \text{CDF}(H_{observed})$

The Kruskal-Wallis test assumes that: a) all groups are independent, b) the response variable is ordinal or continuous, c) the group distributions have the same shape, and d) observations are independent both within and between groups.

Syntax

call fsml_kruskalwallis(x, h, df, p)

Parameters

x: A rank-2 array of type real. It must have at least 2 rows and 2 columns.

Invalid argument values will result in the return of a sentinel value.

Returns

h: A scalar of the same type as x.

df: A scalar of the same type as x.

p: A scalar of the same type as x.

Examples

program example_tst

! |--------------------------------------------------------------------|
! | fsml - fortran statistics and machine learning library             |
! |                                                                    |
! | about                                                              |
! | -----                                                              |
! | Examples for statistical tests (tst module).                       |
! |                                                                    |
! | license : MIT                                                      |
! | author  : Sebastian G. Mutz (sebastian@sebastianmutz.com)          |
! |--------------------------------------------------------------------|

  use :: fsml
  use :: fsml_ini ! import wp; alternatively: iso_fortran_env, wp => real64

  implicit none

! ==== Description
!! The programme demonstrates the use of all statistical tests based on
!! the same few data vectors.

! ==== Declarations
  integer(i4), parameter :: n1 = 10, n2 = 13
  real(wp)   , parameter :: mu = 1.25_wp
  real(wp)   , parameter :: x1(n1) =        &
                                  [1.10_wp, &
                                   1.02_wp, &
                                   1.12_wp, &
                                   2.01_wp, &
                                   1.92_wp, &
                                   1.01_wp, &
                                   1.10_wp, &
                                   1.26_wp, &
                                   1.51_wp, &
                                   1.01_wp]
  real(wp)   , parameter :: x2(n2) =        &
                                  [2.10_wp, &
                                   1.02_wp, &
                                   1.22_wp, &
                                   1.05_wp, &
                                   0.95_wp, &
                                   1.02_wp, &
                                   2.00_wp, &
                                   1.05_wp, &
                                   1.12_wp, &
                                   1.20_wp, &
                                   1.12_wp, &
                                   1.01_wp, &
                                   1.12_wp]
  real(wp)   , parameter :: x2d(5,3) = reshape([ &
                                     & 2.1_wp, 2.5_wp, 1.9_wp, 2.3_wp, 2.0_wp, & ! Group 1
                                     & 2.4_wp, 2.8_wp, 2.6_wp, 3.2_wp, 2.9_wp, & ! Group 2
                                     & 2.0_wp, 2.8_wp, 2.1_wp, 2.3_wp, 2.9_wp  & ! Group 3
                                     & ], shape=[5,3])
  real(wp)               :: t, p, df, df1, df2

! ==== Instructions

! ---- Parametric Tests

  print*
  write(*,'(A)') " Parametric Tests"
  write(*,'(A)') " ================"
  print*

  ! 1-sample t-test
  call fsml_ttest_1sample(x1, mu, t, df, p, h1="two")
  write(*,'(A)') "> 1-sample student's t-test"
  write(*,'(A,10F10.4)') "  samples:            ", x1
  write(*,'(A,F10.4)')   "  test statistic (t): ", t
  write(*,'(A,F10.4)')   "  degrees of freedom: ", df
  write(*,'(A,F10.4)')   "  p-value:            ", p
  print*
  ! test statistic (t):     0.4672
  ! degrees of freedom:     9.0000
  ! p-value:                0.6514


  ! paired t-test
  call fsml_ttest_paired(x1, x2(:n1), t, df, p, h1="two")
  write(*,'(A)') "> 2-sample paired t-test"
  write(*,'(A,10F10.4)') "  samples 1:          ", x1
  write(*,'(A,10F10.4)') "  samples 2:          ", x2(:n1)
  write(*,'(A,F10.4)')   "  test statistic (t): ", t
  write(*,'(A,F10.4)')   "  degrees of freedom: ", df
  write(*,'(A,F10.4)')   "  p-value:            ", p
  print*
  ! test statistic (t):     0.1584
  ! degrees of freedom:     9.0000
  ! p-value:                0.8776


  ! 2-sample pooled t-test (assume equal variances)
  call fsml_ttest_2sample(x1, x2, t, df, p, eq_var=.true., h1="two")
  write(*,'(A)') "> 2-sample pooled t-test"
  write(*,'(A,10F10.4)') "  samples 1:          ", x1
  write(*,'(A,13F10.4)') "  samples 2:          ", x2
  write(*,'(A,F10.4)')   "  test statistic (t): ", t
  write(*,'(A,F10.4)')   "  degrees of freedom: ", df
  write(*,'(A,F10.4)')   "  p-value:            ", p
  print*
  ! test statistic (t):     0.4862
  ! degrees of freedom:    21.0000
  ! p-value:                0.6319


  ! 2-sample t-test for unequal variances (Welch's t-test)
  call fsml_ttest_2sample(x1, x2, t, df, p, eq_var=.false., h1="two")
  write(*,'(A)') "> 2-sample Welch's t-test"
  write(*,'(A,10F10.4)') "  samples 1:          ", x1
  write(*,'(A,13F10.4)') "  samples 2:          ", x2
  write(*,'(A,F10.4)')   "  test statistic (t): ", t
  write(*,'(A,F10.4)')   "  degrees of freedom: ", df
  write(*,'(A,F10.4)')   "  p-value:            ", p
  print*
  ! test statistic (t):     0.4850
  ! degrees of freedom:    19.3423
  ! p-value:                0.6331

  ! 1-way ANOVA
  call fsml_anova_1way(x2d, t, df1, df2, p)
  write(*,'(A)') "> 1-way ANOVA"
  write(*,'(A,5F10.4)') "  group 1:            ", x2d(:, 1)
  write(*,'(A,5F10.4)') "  group 2:            ", x2d(:, 2)
  write(*,'(A,5F10.4)') "  group 3:            ", x2d(:, 3)
  write(*,'(A,F10.4)')  "  test statistic (F): ", t
  write(*,'(A,F10.4)')  "  df between:         ", df1
  write(*,'(A,F10.4)')  "  df within:          ", df2
  write(*,'(A,F10.4)')  "  p-value:            ", p
  ! test statistic (F):     4.5868
  ! df between:             2.0000
  ! df within:             12.0000
  ! p-value:                0.0331


! ---- Non-Parametric Tests

  print*
  write(*,'(A)') " Non-Parametric Tests"
  write(*,'(A)') " ===================="
  print*

  ! 1-sample Wilcoxon signed rank test
  call fsml_signedrank_1sample(x1, mu, t, p, h1="two")
  write(*,'(A)') "> 1-sample Wilcoxon signed rank test"
  write(*,'(A,10F10.4)') "  samples:            ", x1
  write(*,'(A,F10.4)')   "  test statistic (w): ", t
  write(*,'(A,F10.4)')   "  p-value:            ", p
  print*
  ! test statistic (w):    27.0000
  ! p-value:                0.9594

  ! 2-sample (paired sample) Wilcoxon signed rank test
  call fsml_signedrank_paired(x1, x2(:n1), t, p, h1="two")
  write(*,'(A)') "> paired sample Wilcoxon signed rank test"
  write(*,'(A,10F10.4)') "  samples 1:          ", x1
  write(*,'(A,10F10.4)') "  samples 2:          ", x2(:n1)
  write(*,'(A,F10.4)')   "  test statistic (w): ", t
  write(*,'(A,F10.4)')   "  p-value:            ", p
  print*
  ! test statistic (w):    21.0000
  ! p-value:                0.8590


  ! 2-sample Wilcoxon Mann-Whitney U rank sum test
  call fsml_ranksum(x1, x2, t, p, h1="two")
  write(*,'(A)') "> Mann-Whitney U rank sum test"
  write(*,'(A,10F10.4)') "  samples 1:          ", x1
  write(*,'(A,13F10.4)') "  samples 2:          ", x2
  write(*,'(A,F10.4)')   "  test statistic (U): ", t
  write(*,'(A,F10.4)')   "  p-value:            ", p
  print*
  ! test statistic (U):    59.5000
  ! p-value:                0.7330


  ! Kruskal Wallis H test
  call fsml_kruskalwallis(x2d, t, df, p)
  write(*,'(A)') "> Kruskal Wallis H test"
  write(*,'(A,5F10.4)') "  group 1:            ", x2d(:, 1)
  write(*,'(A,5F10.4)') "  group 2:            ", x2d(:, 2)
  write(*,'(A,5F10.4)') "  group 3:            ", x2d(:, 3)
  write(*,'(A,F10.4)')  "  test statistic (H): ", t
  write(*,'(A,F10.4)')  "  degrees of freedom  ", df
  write(*,'(A,F10.4)')  "  p-value:            ", p
  ! test statistic (H):     5.9850
  ! degrees of freedom      2.0000
  ! p-value:                0.0502

end program example_tst

TST: Statistical Tests

Overview

Student t-test (1 sample)

fsml_ttest_1sample

Description

Syntax

Parameters

Returns

Paired sample t-test

fsml_ttest_paired

Description

Syntax

Parameters

Returns

Pooled and Welch's t-test (2 sample)

fsml_ttest_2sample

Description

Syntax

Parameters

Returns

Analysis of variance (one way)

fsml_anova_1way

Description

Syntax

Parameters

Returns

Wilcoxon signed-rank (1 sample)

fsml_signedrank_1sample

Description

Syntax

Parameters

Returns

Wilcoxon signed-rank (paired)

fsml_signedrank_paired

Description

Syntax

Parameters

Returns

Mann–Whitney U rank-sum (2 sample)

fsml_ranksum

Description

Syntax

Parameters

Returns

Kruskall Wallis H test

fsml_kruskalwallis

Description

Syntax

Parameters

Returns

Examples

`fsml_ttest_1sample`

`fsml_ttest_paired`

`fsml_ttest_2sample`

`fsml_anova_1way`

`fsml_signedrank_1sample`

`fsml_signedrank_paired`

`fsml_ranksum`

`fsml_kruskalwallis`