Section 3.4 — Hypothesis testing using analytical approximations

Section 3.4 — Hypothesis testing using analytical approximations#

This notebook contains the code examples from Section 3.4 Hypothesis testing using analytical approximations of the No Bullshit Guide to Statistics.

Notebook setup#

# load Python modules
import os
import numpy as np
import pandas as pd
import seaborn as sns
import matplotlib.pyplot as plt

# Plot helper functions
from ministats import nicebins
from ministats import plot_pdf
from ministats.utils import savefigure

WARNING (pytensor.tensor.blas): Using NumPy C-API based implementation for BLAS functions.

# Figures setup
plt.clf()  # needed otherwise `sns.set_theme` doesn't work
from plot_helpers import RCPARAMS
# RCPARAMS.update({'figure.figsize': (10, 3)})   # good for screen
RCPARAMS.update({'figure.figsize': (5, 1.6)})  # good for print
sns.set_theme(
    context="paper",
    style="whitegrid",
    palette="colorblind",
    rc=RCPARAMS,
)

# Useful colors
snspal = sns.color_palette()
blue, orange, purple = snspal[0], snspal[1], snspal[4]
# red = sns.color_palette("tab10")[3]

# High-resolution please
%config InlineBackend.figure_format = 'retina'

# Where to store figures
DESTDIR = "figures/stats/intro_to_NHST"

<Figure size 640x480 with 0 Axes>

# set random seed for repeatability
np.random.seed(42)

$\def\stderr#1{\mathbf{se}_{#1}}$ $\def\stderrhat#1{\hat{\mathbf{se}}_{#1}}$ $\newcommand{\Mean}{\textbf{Mean}}$ $\newcommand{\Var}{\textbf{Var}}$ $\newcommand{\Std}{\textbf{Std}}$ $\newcommand{\Freq}{\textbf{Freq}}$ $\newcommand{\RelFreq}{\textbf{RelFreq}}$ $\newcommand{\DMeans}{\textbf{DMeans}}$ $\newcommand{\Prop}{\textbf{Prop}}$ $\newcommand{\DProps}{\textbf{DProps}}$

\[ \newcommand{\CI}[1]{\textbf{CI}_{#1}} \newcommand{\CIL}[1]{\textbf{L}_{#1}} \newcommand{\CIU}[1]{\textbf{U}_{#1}} \newcommand{\ci}[1]{\textbf{ci}_{#1}} \newcommand{\cil}[1]{\textbf{l}_{#1}} \newcommand{\ciu}[1]{\textbf{u}_{#1}} \]

(this cell contains the macro definitions like $\stderr{\overline{\mathbf{x}}}$, $\stderrhat{}$, $\Mean$, …)

Definitions#

Context#

test statistic
sampling distribution of the test statistic
pivotal transformation
reference distributions

def mean(sample):
    return sum(sample) / len(sample)

def var(sample):
    xbar = mean(sample)
    sumsqdevs = sum([(xi-xbar)**2 for xi in sample])
    return sumsqdevs / (len(sample)-1)

def std(sample):
    s2 = var(sample)
    return np.sqrt(s2)

Analytical approximations#

# theoretical model for the kombucha volumes
muX0 = 1000   # population mean (expected kombucha volume)
sigmaX0 = 10  # population standard deviation

sample = [1005.19,  987.31, 1002.4,  991.96,
          1000.17, 1003.94, 1012.79]
n = len(sample)  # sample size

Normal approximation to the sample mean#

from scipy.stats import norm

sigmaX0 = 10  # population standard deviation
se = sigmaX0 / np.sqrt(n)
rvNXbar_0 = norm(loc=muX0, scale=se)

ax = plot_pdf(rvNXbar_0, xlims=[980, 1020])
ax.set_ylabel("$f_{\overline{\mathbf{X}}_0}$")
ax.set_xlabel("$\overline{\mathbf{x}}$");

../_images/a1b1168582c181b66619d8f9bbfed852d8b37f7a9c80a3e5f85f7e05e535bdd2.png

Student’s $t$-approximation to the sample mean#

from scipy.stats import t as tdist

muX0 = 1000      # population mean (expected kombucha volume)
sigmaX0 = 10     # population standard deviation
n = len(sample)  # sample size
sehat = std(sample) / np.sqrt(n)
rvTXbar_0 = tdist(df=n-1, loc=muX0, scale=sehat)

ax = plot_pdf(rvTXbar_0, xlims=[980, 1020])
ax.set_ylabel("$f_{\overline{\mathbf{X}}_0}$")
ax.set_xlabel("$\overline{\mathbf{x}}$");

../_images/6af4b22074aa065fcdd7385aabe55d625974fc699213b9e59939fa27375fed2f.png

The $\chi^2$-approximation to the sample variance#

from scipy.stats import chi2
sigmaX0 = 10     # population standard deviation
n = len(sample)  # sample size
scale = sigmaX0**2 / (n-1)
rvQS2_0 = chi2(df=n-1, scale=scale)

ax = plot_pdf(rvQS2_0, rv_name="Q")
ax.set_ylabel("$f_{S^2_{\mathbf{X},0}}$")
ax.set_xlabel("$s^2_{\mathbf{x}}$");

../_images/95f72f466b885405e2d0d5062587cda429404fe920840b9e8a93ae7dd4fba80a.png

Calculating $p$-values#

# FIGURES ONLY
filename = os.path.join(DESTDIR, "panel_p-values_theta_left_twotailed_right_tests.pdf")
    
from scipy.stats import t as tdist
rvT = tdist(df=9)

xs = np.linspace(-4, 4, 1000)
ys = rvT.pdf(xs)

with plt.rc_context({"figure.figsize":(9,2.3)}), sns.axes_style("ticks"):
    fig, (ax3, ax1, ax2) = plt.subplots(1,3)
    ax3.set_ylabel("$f_{\widehat{\Theta}_0}$")

    # RIGHT
    title = '(a) right-tailed test'
    ax3.set_title(title, fontsize=13)#, y=-0.26)
    sns.lineplot(x=xs, y=ys, ax=ax3)
    ax3.set_xlim(-4, 4)
    ax3.set_ylim(0, 0.42)
    ax3.set_xticks([0,2])
    ax3.set_xticklabels([])
    ax3.set_yticks([])
    ax3.spines[['right', 'top']].set_visible(False)

    # highlight the right tail
    mask = (xs > 2)
    ax3.fill_between(xs[mask], y1=ys[mask], alpha=0.6, facecolor="red")
    ax3.vlines([2], ymin=0, ymax=rvT.pdf(2), linestyle="-", alpha=0.5, color="red")
    ax3.text(2, -0.03, r"$\hat{\theta}_{\mathbf{x}}$", va="top", ha="center")
    ax3.text(0, -0.04, r"$\theta_0$", va="top", ha="center")


    # LEFT
    title = '(b) left-tailed test'
    ax1.set_title(title, fontsize=13) #, y=-0.26)
    sns.lineplot(x=xs, y=ys, ax=ax1)
    ax1.set_xlim(-4, 4)
    ax1.set_ylim(0, 0.42)
    ax1.set_xticks([-2,0])
    ax1.set_xticklabels([])
    ax1.set_yticks([])
    ax1.spines[['left', 'right', 'top']].set_visible(False)

    # highlight the left tail
    mask = (xs < -2)
    ax1.fill_between(xs[mask], y1=ys[mask], alpha=0.6, facecolor="red")
    ax1.vlines([-2], ymin=0, ymax=rvT.pdf(-2), linestyle="-", alpha=0.5, color="red")
    ax1.text(-2, -0.03, r"$\hat{\theta}_{\mathbf{x}}$", va="top", ha="center")
    ax1.text(0, -0.04, r"$\theta_0$", va="top", ha="center")


    # TWO-TAILED
    title = '(c) two-tailed test'
    ax2.set_title(title, fontsize=13)#, y=-0.26)
    sns.lineplot(x=xs, y=ys, ax=ax2)
    ax2.set_xlim(-4, 4)
    ax2.set_ylim(0, 0.42)
    ax2.set_xticks([-2,0,2])
    ax2.set_xticklabels([])
    ax2.set_yticks([])
    ax2.spines[['left', 'right', 'top']].set_visible(False)

    # highlight the left and right tails
    mask = (xs < -2)
    ax2.fill_between(xs[mask], y1=ys[mask], alpha=0.6, facecolor="red")
    ax2.vlines([-2], ymin=0, ymax=rvT.pdf(-2), linestyle="-", alpha=0.5, color="red")
    # ax2.text(-2, -0.03, r"$-|\hat{\theta}_{\mathbf{x}}|$", va="top", ha="center")
    ax2.text(-2, -0.03, r"$\theta_0$-dev", va="top", ha="center")
    mask = (xs > 2)
    ax2.fill_between(xs[mask], y1=ys[mask], alpha=0.6, facecolor="red")
    ax2.vlines([2], ymin=0, ymax=rvT.pdf(2), linestyle="-", alpha=0.5, color="red")
    ax2.text(2, -0.03, r"$\theta_0$+dev", va="top", ha="center")
    ax2.text(0, -0.04, r"$\theta_0$", va="top", ha="center")

savefigure(fig, filename)

Saved figure to figures/stats/intro_to_NHST/panel_p-values_theta_left_twotailed_right_tests.pdf
Saved figure to figures/stats/intro_to_NHST/panel_p-values_theta_left_twotailed_right_tests.png

../_images/203f2bed02cf2dc008eb046f93366e1095cd875c8687768fb9952ed5b58ee51a.png

Pivotal transformations#

Calculating $p$-values for standardized test statistics#

# FIGURES ONLY
filename = os.path.join(DESTDIR, "panel_p-values_t_left_twotailed_right_tests.pdf")
    
from scipy.stats import t as tdist
rvT = tdist(df=9)

xs = np.linspace(-4, 4, 1000)
ys = rvT.pdf(xs)

with plt.rc_context({"figure.figsize":(9,2.3)}), sns.axes_style("ticks"):
    fig, (ax3, ax1, ax2) = plt.subplots(1,3)
    ax3.set_ylabel("$f_{T_0}$")

    # RIGHT
    title = '(a) right-tailed test'
    ax3.set_title(title, fontsize=13)#, y=-0.26)
    sns.lineplot(x=xs, y=ys, ax=ax3)
    ax3.set_xlim(-4, 4)
    ax3.set_ylim(0, 0.42)
    ax3.set_xticks([0,2])
    ax3.set_xticklabels([])
    ax3.set_yticks([])
    ax3.spines[['right', 'top']].set_visible(False)

    # highlight the right tail
    mask = (xs > 2)
    ax3.fill_between(xs[mask], y1=ys[mask], alpha=0.6, facecolor="red")
    ax3.vlines([2], ymin=0, ymax=rvT.pdf(2), linestyle="-", alpha=0.5, color="red")
    ax3.text(2, -0.03, "$t_{\mathbf{x}}$", va="top", ha="center")
    ax3.text(0, -0.04, r"$0$", va="top", ha="center")

    # LEFT
    title = '(b) left-tailed test'
    ax1.set_title(title, fontsize=13) #, y=-0.26)
    sns.lineplot(x=xs, y=ys, ax=ax1)
    ax1.set_xlim(-4, 4)
    ax1.set_ylim(0, 0.42)
    ax1.set_xticks([-2,0])
    ax1.set_xticklabels([])
    ax1.set_yticks([])
    ax1.spines[['left', 'right', 'top']].set_visible(False)

    # highlight the left tail
    mask = (xs < -2)
    ax1.fill_between(xs[mask], y1=ys[mask], alpha=0.6, facecolor="red")
    ax1.vlines([-2], ymin=0, ymax=rvT.pdf(-2), linestyle="-", alpha=0.5, color="red")
    ax1.text(-2, -0.03, r"$t_{\mathbf{x}}$", va="top", ha="center")
    ax1.text(0, -0.04, r"$0$", va="top", ha="center")


    # TWO-TAILED
    title = '(c) two-tailed test'
    ax2.set_title(title, fontsize=13)#, y=-0.26)
    sns.lineplot(x=xs, y=ys, ax=ax2)
    ax2.set_xlim(-4, 4)
    ax2.set_ylim(0, 0.42)
    ax2.set_xticks([-2,0,2])
    ax2.set_xticklabels([])
    ax2.set_yticks([])
    ax2.spines[['left', 'right', 'top']].set_visible(False)

    # highlight the left and right tails
    mask = (xs < -2)
    ax2.fill_between(xs[mask], y1=ys[mask], alpha=0.6, facecolor="red")
    ax2.vlines([-2], ymin=0, ymax=rvT.pdf(-2), linestyle="-", alpha=0.5, color="red")
    ax2.text(-2, -0.03, r"$-|t_{\mathbf{x}}|$", va="top", ha="center")
    mask = (xs > 2)
    ax2.fill_between(xs[mask], y1=ys[mask], alpha=0.6, facecolor="red")
    ax2.vlines([2], ymin=0, ymax=rvT.pdf(2), linestyle="-", alpha=0.5, color="red")
    ax2.text(2, -0.03, r"$|t_{\mathbf{x}}|$", va="top", ha="center")
    ax2.text(0, -0.04, r"$0$", va="top", ha="center")


savefigure(fig, filename)

Saved figure to figures/stats/intro_to_NHST/panel_p-values_t_left_twotailed_right_tests.pdf
Saved figure to figures/stats/intro_to_NHST/panel_p-values_t_left_twotailed_right_tests.png

../_images/d06dbb3b9e57c45fa1f5f54d28c8efe1afa8878780790ebe3f894522ea31899e.png

Test for the mean (known variance)#

Analytical approximation for the sample mean#

If the theoretical distribution under the null is normally distributed $X_0 \sim \mathcal{N}(\mu_{X_0}, \sigma_{X_0})$, then the central limit theorem tells us the sampling distribution of the mean is described by the formula

\[ \overline{\mathbf{X}}_0 \sim \mathcal{N}(\mu_{X_0}, \frac{\sigma_{X_0}}{\sqrt{n}}). \]

Kombucha bottling process#

We’ll use the kombucha scenario for all the examples in this section. Recall, the probability distirbution of the kombucha volume is described by the theoretical model $K_0 \sim \mathcal{N}(\mu_{K_0} = 1000, \sigma_{K_0}=10)$ when the production line is working as expected.

We can use central limit theorem to obtain the sampling distribution of the mean since the parameters $\mu_{K_0}$ and $\sigma_{K_0}$ are known:

\[ \overline{\mathbf{K}}_0 \sim \mathcal{N}(\mu_{K_0}, \frac{\sigma_{K_0}}{\sqrt{n}}). \]

# parameters of the theoretical model for the kombucha volumes
muK0 = 1000   # population mean (expected kombucha volume)
sigmaK0 = 10  # population standard deviation

Example 1N: test for the mean of Batch 04#

kombucha = pd.read_csv("../datasets/kombucha.csv")
batch04 = kombucha[kombucha["batch"]==4]
ksample04 = batch04["volume"]
# sample size
n04 = len(ksample04)
n04

# observed mean
obsmean04 = mean(ksample04)
obsmean04

1003.8335

# standard error of the mean
se04 = sigmaK0 / np.sqrt(n04)
se04

1.5811388300841895

We’ll perform $p$-value calculations based on the standardized test statistic $$ z_{\mathbf{k}} = \frac{ \overline{\mathbf{k}} - \mu_{K_0} }{ \stderr{\overline{\mathbf{k}},0} }. $$

# compute the z statistic 
obsz04 = (obsmean04 - muK0) / se04
obsz04

2.42451828205107

The reference distribution is the standard normal distribution $Z_0 \sim \mathcal{N}(\tt{loc}=0, \; \tt{scale}=1)$.

We can use cumulative distribution function $F_{Z_0}$.

from scipy.stats import norm
rvZ0 = norm(loc=0, scale=1)

#          left tail          +  right tail 
pvalue04 = rvZ0.cdf(-obsz04)  +  (1 - rvZ0.cdf(obsz04))
pvalue04

0.015328711497996476

filename = os.path.join(DESTDIR, "p-value_tails_kombucha_obsz04.pdf")
from ministats import calc_prob_and_plot_tails
_, ax = calc_prob_and_plot_tails(rvZ0, -obsz04, obsz04, xlims=[-4,4])
ax.set_title(None)
ax.set_xlabel("$z$")
ax.set_ylabel("$f_{Z_0}$")
savefigure(ax, filename)

Saved figure to figures/stats/intro_to_NHST/p-value_tails_kombucha_obsz04.pdf
Saved figure to figures/stats/intro_to_NHST/p-value_tails_kombucha_obsz04.png

../_images/2fffa529f7b2261c1d25cdd818101cd4072095b0bcf157b6601d8f22cd5c9f9c.png

Alternative calculation without pivotal transformation#

Note also the pivotal transformation to the standard $Z$ is not necessary. We could have obtained the same $p$-value directly from the sampling distribution of the mean, which is described by a non-standard normal distribution:

\[ \overline{\mathbf{K}}_0 = \mathcal{N}(\texttt{loc}=\mu_{K_0},\; \texttt{scale}=\stderr{\overline{\mathbf{k}}_{04},0}) \]

from scipy.stats import norm
rvKbar0 = norm(loc=muK0, scale=se04)
#######################################################

dev = abs(obsmean04 - muK0)
dev

3.833499999999958

# left tail            +  right tail 
rvKbar0.cdf(muK0-dev)  +  (1 - rvKbar0.cdf(muK0+dev))

0.015328711497996476

filename = os.path.join(DESTDIR, "p-value_tails_Example1N_Kbar0.pdf")
from ministats import calc_prob_and_plot_tails
dev = abs(obsmean04-muK0)
_, ax = calc_prob_and_plot_tails(rvKbar0, muK0-dev, muK0+dev)
ax.set_title(None)
ax.set_xlabel("$\overline{\mathbf{k}}$")
ax.set_ylabel("$f_{\overline{\mathbf{K}}_0}$")
savefigure(ax, filename)

Saved figure to figures/stats/intro_to_NHST/p-value_tails_Example1N_Kbar0.pdf
Saved figure to figures/stats/intro_to_NHST/p-value_tails_Example1N_Kbar0.png

../_images/400934925b3d8943d956c408e94c5c14992cbc2ab852d4c44253934951189a49.png

The $p$-value we obtain is exactly the same.

Example 2N: test for the mean of Batch 01#

kombucha = pd.read_csv("../datasets/kombucha.csv")
ksample01 = kombucha[kombucha["batch"]==1]["volume"]
n01 = len(ksample01)
n01

# observed mean
obsmean01 = mean(ksample01)
obsmean01

999.10375

# standard error of the mean
se01 = sigmaK0 / np.sqrt(n01)
se01

1.5811388300841895

Note the standard error is the same as se04 we calculated in Example 1N. This is because the sample size is the same, and we’re relying on the same assumptions about the standard deviation of the theoretical distribution.

# compute the z statistic 
obsz01 = (obsmean01 - muK0) / se01
obsz01

-0.5668382705851878

from scipy.stats import norm

rvZ0 = norm(loc=0, scale=1)

#          left tail               +  right tail 
pvalue01 = rvZ0.cdf(-abs(obsz01))  +  (1 - rvZ0.cdf(abs(obsz01)))
pvalue01

0.5708240666473916

The $p$-value is large, so there is no reason to reject $H_0$. We conclude that Batch 01 must be regular.

filename = os.path.join(DESTDIR, "p-value_tails_kombucha_obsz01.pdf")
from ministats import calc_prob_and_plot_tails
_, ax= calc_prob_and_plot_tails(rvZ0, obsz01, -obsz01, xlims=[-4,4])
ax.set_title(None)
ax.set_xlabel("$z$")
ax.set_ylabel("$f_{Z_0}$")
savefigure(ax, filename)

Saved figure to figures/stats/intro_to_NHST/p-value_tails_kombucha_obsz01.pdf
Saved figure to figures/stats/intro_to_NHST/p-value_tails_kombucha_obsz01.png

../_images/ebc386ee42fdf4eec37444458719d744a03b99739d600dbd6712aec423a2e3a2.png

Test for the mean (unknown variance)#

Analytical approximation based on Student’s $t$-distribution#

Consider again a theoretical distribution $X_0 \sim \mathcal{N}(\mu_{X_0}, \sigma_{X_0})$, but this time assume that $\sigma_{X_0}$ is not known.

The sampling distribution of the mean $\overline{\mathbf{X}}_0$ for samples of size $n$, after applying the location-scale transformation, can be modelled in terms of the standard $t$-distribution with $n-1$ degrees of freedom:

\[ T_0 = \frac{ \overline{\mathbf{X}}_0-\mu_{X_0} }{ \stderrhat{\overline{\mathbf{x}}} } \;\; \sim \;\; \mathcal{T}(\tt{df}=n-1, \; \tt{loc}=0, \; \tt{scale}=1). \]

Example 1T: test for the mean of Batch 04#

Assume we know mean, but not variance#

kombucha = pd.read_csv("../datasets/kombucha.csv")
ksample04 = kombucha[kombucha["batch"]==4]["volume"]
n04 = len(ksample04)
n04

# estimated standard error of the mean
s04 = std(ksample04)
s04

7.85217413955834

sehat04 = s04 / np.sqrt(n04)
sehat04

1.2415377432638601

Recall the value se04 = $\stderr{\overline{\mathbf{k}}_{04},0}=\frac{\sigma_{K_0}}{\sqrt{40}} = 1.58$, which we obtained by assuming the population standard deviation is known. We see that sehat04 is an underestimate, because the sample standard deviation s04 happens to be smaller than the true population standard deviation $\sigma_{K_0} = 10$.

# observed sample mean
obsmean04 = mean(ksample04)
obsmean04

1003.8335

# compute the t statistic 
obst04 = (obsmean04 - muK0) / sehat04
obst04

3.087703149420272

from scipy.stats import t as tdist

df04 = n04 - 1  # n-1 degrees of freedom
rvT04 = tdist(df=df04)

#           left tail          +  right tail 
pvalue04t = rvT04.cdf(-obst04) +  (1-rvT04.cdf(obst04))
pvalue04t

0.00370566535033293

# no figure because too small

Effect size estimates#

We can estimate the effect size using the formula $\widehat{\Delta} = \overline{\mathbf{k}}_{04} - \mu_{K_0}$.

obsmean04 - muK0

3.833499999999958

We can also obtain confidence interval for the effect size $\ci{\Delta,0.9}$ by first computing a 90% confidence interval for the population mean $\ci{\mu,0.9}$, then subtracting theoretical mean $\mu_{K_0}$.

from ministats import ci_mean
cimu04 = ci_mean(ksample04, alpha=0.1, method="a")
cimu04

[1001.7416639464577, 1005.9253360535422]

[cimu04[0]-muK0, cimu04[1]-muK0]

[1.7416639464577202, 5.925336053542196]

Example 2T: test for the mean of Batch 01#

kombucha = pd.read_csv("../datasets/kombucha.csv")
ksample01 = kombucha[kombucha["batch"]==1]["volume"]

# estimated standard error of the mean
s01 = std(ksample01)
n01 = len(ksample01)
sehat01 = s01 / np.sqrt(n01)
sehat01

1.5446402654597249

cf. se01 = $\stderr{\overline{\mathbf{k}}_{01},0}=\frac{\sigma_{K_0}}{\sqrt{40}} = 1.58$,

# observed sample mean
obsmean01 = mean(ksample01)
obsmean01

999.10375

# compute the t statistic 
obst01 = (obsmean01 - muK0) / sehat01
obst01

-0.5802321874169595

from scipy.stats import t as tdist

df01 = n01 - 1  # n-1 degrees of freedom
rvT01 = tdist(df=df01)

#######################################################
#           left tail          +  right tail 
pvalue01t = rvT01.cdf(-abs(obst01)) + 1-rvT01.cdf(abs(obst01))
pvalue01t

0.5650956637295477

# # ALT. compute right tail and double the value
# p_right = 1 - rvT01.cdf(abs(obst01))
# pvalue01t = 2 * p_right 
# pvalue01t

filename = os.path.join(DESTDIR, "p-value_tails_kombucha_obst01.pdf")
from ministats import calc_prob_and_plot_tails
_, ax = calc_prob_and_plot_tails(rvT01, obst01, -obst01, xlims=[-4,4])
ax.set_title(None)
ax.set_xlabel("$t$")
ax.set_ylabel("$f_{T_0}$")
savefigure(ax, filename)

Saved figure to figures/stats/intro_to_NHST/p-value_tails_kombucha_obst01.pdf
Saved figure to figures/stats/intro_to_NHST/p-value_tails_kombucha_obst01.png

../_images/e547aa7be03e085ce29542f81c49acffcc38f7c4c3e537aba671a93994e58917.png

Test for the variance#

Formula for sampling distribution of the variance#

Chi-square test for variance#

If the theoretical distribution under the null hypothesis is normal $K_0 \sim \mathcal{N}(\mu_{K_0}, \sigma_{K_0})$, then the sampling distribution of variance for samples of size $n$ is described by a scaled chi-square distribution:

\[ S_{\mathbf{K},0}^2 \;\; \sim \;\; \chi^2(\tt{df}\!=\!n-1, \; \tt{scale}\!=\!\tfrac{ \sigma_{K_0}^2 }{ n-1 } ) \; = \; \tfrac{ \sigma_{K_0}^2 }{ n-1 } \cdot \chi^2(n-1). \]

Example 3X: test for the variance of Batch 02#

kombucha = pd.read_csv("../datasets/kombucha.csv")
ksample02 = kombucha[kombucha["batch"]==2]["volume"]
n02 = len(ksample02)
n02

obsvar02 = var(ksample02)
obsvar02

124.31760105263136

We can now compute the $q$-statistic, which is the observed sample variance estimate $s_{\mathbf{k}_{02}}^2 = 124.32$ divided by the scale factor $\tfrac{ \sigma_{K_0}^2 }{ n-1 }$.

obsq02 = (n02-1) * obsvar02 / sigmaK0**2
obsq02

23.62034419999996

from scipy.stats import chi2

rvX2 = chi2(df=n02-1)
pvalue02 = 1 - rvX2.cdf(obsq02)
pvalue02

0.2111207328360385

The $p$-value is large, so there is no reason to reject $H_0$.

filename = os.path.join(DESTDIR, "p-value_right_tail_kombucha_obsq02.pdf")
_, ax = calc_prob_and_plot_tails(rvX2, 0, obsq02, xlims=[0,50])
ax.set_title(None)
ax.set_xlabel("$q$")
ax.set_ylabel("$f_{Q_0}$")
savefigure(ax, filename)

Saved figure to figures/stats/intro_to_NHST/p-value_right_tail_kombucha_obsq02.pdf
Saved figure to figures/stats/intro_to_NHST/p-value_right_tail_kombucha_obsq02.png

../_images/71c5f79142ef6dead5e90572314699893352d5cf9927448159e0ba0f59cfe5de.png

Example 4X: test for the variance of Batch 08#

kombucha = pd.read_csv("../datasets/kombucha.csv")
ksample08 = kombucha[kombucha["batch"]==8]["volume"]
n08 = len(ksample08)
n08

obsvar08 = var(ksample08)
obsvar08

169.9979220512824

obsq08 = (n08-1) * obsvar08 / sigmaK0**2
obsq08

66.29918960000013

from scipy.stats import chi2

rvX2 = chi2(df=n08-1)
pvalue08 = 1 - rvX2.cdf(obsq08)
pvalue08

0.0041211873587608805

The $p$-value is vary small, so we reject $H_0$. According to our analysis based on the sample variance from Batch 08, this batch seems to be irregular: it has an abnormally large variance.

Effect size estimates#

We can estimate the effect size using the formula $\widehat{\Delta} = s^2_{\mathbf{k}_{08}} / \sigma_{K_0}^2$.

obsvar08 / sigmaK0**2

1.699979220512824

We can also obtain confidence interval for the effect size $\ci{\Delta,0.9}$ by first computing a 90% confidence interval for the variance $\ci{\sigma^2,0.9}$, then dividing by the theoretical variance $\sigma_{K_0}^2$.

from ministats import ci_var
civar08 = ci_var(ksample08, alpha=0.1, method="a")
ci_l = civar08[0] / sigmaK0**2
ci_u = civar08[1] / sigmaK0**2
[ci_l, ci_u]

[1.2148888239061655, 2.58019779302895]

Compare this to the bootstrap confidence interval for the effect size $\ci{\Delta,0.9}^* = [1.17, 2.18]$ , which we obtained earlier in Section 3.3.

Alternative calculation methods#

Using `scipy.stats.ttest_1samp` for one-sample $t$-test#

# ALT. using existing function `scipy.stats`
from scipy.stats import ttest_1samp
res = ttest_1samp(ksample04, popmean=muK0)
res.pvalue

0.0037056653503329618

cimu04 = res.confidence_interval(confidence_level=0.9)
[cimu04.low - muK0, cimu04.high - muK0]

[1.7416639464577202, 5.925336053542196]

We see the $p$-value and the 90% confidence interval are same the ones we calculated earlier in Example 1T.

Bootstrap estimate of the standard error#

Another way to obtain the standard error of the mean (the standard deviation of the sampling distribution of the mean) is to use the bootstrap estimate.

See problems PNN and PMM in the notebook chapter3_problems.ipynb.

Explanations#

Helper function for calculating $p$-values#

def tailprobs(rvH0, obs, alt="two-sided"):
    """
    Calculate the probability of all outcomes of the random variable `rvH0`
    that are equal or more extreme than the observed value `obs`.
    """
    assert alt in ["greater", "less", "two-sided"]
    if alt == "greater":
        pvalue = 1 - rvH0.cdf(obs)
    elif alt == "less":
        pvalue = rvH0.cdf(obs)
    elif alt == "two-sided":  # assumes distribution is symmetric
        meanH0 = rvH0.mean()
        obsdev = abs(obs - meanH0)
        pleft = rvH0.cdf(meanH0 - obsdev)
        pright = 1 - rvH0.cdf(meanH0 + obsdev)
        pvalue = pleft + pright
    return pvalue

One-sided $p$-value calculation example#

Reusing the variables n02=20 and obsq02=23.62 we calculated in Example 3X above, we can calculate the $p$-value by calling the helper function tailprobs with the option alt="greater":

rvX2 = chi2(df=n02-1)
tailprobs(rvX2, obsq02, alt="greater")

0.2111207328360385

The value we obtained is identical to the value pvalue02 we obtained in Example 3X, so we know the function tailprobs works as expected for $p$-value calculations of type (a), where we want to detect only positive deviations from the expected distribution under $H_0$.

Two-sided $p$-value calculation example#

Reusing the variable obsz04 we calculated in Example 1N above, we can calculate the $p$-value by calling the function tailprobs with the option alt="two-sided":

rvZ0 = norm(loc=0, scale=1)
tailprobs(rvZ0, obsz04, alt="two-sided")

0.015328711497996476

The value we obtained is identical to the value pvalue04 we obtained in Example 1N, so we know the function tailprobs works as expected for $p$-value calculations of type (c), where we want to detect both positive and negative deviations from the expected distribution under $H_0$.

Discussion#

Statistical modelling and assumptions#

Exercises#

Exercise NN: use integration to obtain of the probability …#

from scipy.integrate import quad
fZ0 = rvZ0.pdf
oo = np.infty
# left tail                 +  right tail
quad(fZ0, -oo, -obsz04)[0]  +  quad(fZ0, obsz04, oo)[0]

0.015328711497995967

Section 3.4 — Hypothesis testing using analytical approximations

Contents

Section 3.4 — Hypothesis testing using analytical approximations#

Notebook setup#

Definitions#

Context#

Analytical approximations#

Normal approximation to the sample mean#

Student’s \(t\)-approximation to the sample mean#

The \(\chi^2\)-approximation to the sample variance#

Calculating \(p\)-values#

Pivotal transformations#

Calculating \(p\)-values for standardized test statistics#

Test for the mean (known variance)#

Analytical approximation for the sample mean#

Kombucha bottling process#

Example 1N: test for the mean of Batch 04#

Alternative calculation without pivotal transformation#

Example 2N: test for the mean of Batch 01#

Test for the mean (unknown variance)#

Analytical approximation based on Student’s \(t\)-distribution#

Example 1T: test for the mean of Batch 04#

Assume we know mean, but not variance#

Effect size estimates#

Example 2T: test for the mean of Batch 01#

Test for the variance#

Formula for sampling distribution of the variance#

Chi-square test for variance#

Example 3X: test for the variance of Batch 02#

Example 4X: test for the variance of Batch 08#

Effect size estimates#

Alternative calculation methods#

Using scipy.stats.ttest_1samp for one-sample \(t\)-test#

Bootstrap estimate of the standard error#

Explanations#

Helper function for calculating \(p\)-values#

One-sided \(p\)-value calculation example#

Two-sided \(p\)-value calculation example#

Discussion#

Statistical modelling and assumptions#

Exercises#

Exercise NN: use integration to obtain of the probability …#

Links#

Using `scipy.stats.ttest_1samp` for one-sample \(t\)-test#