Statistical analysis examples

Statistical analysis examples#

This directory contains notebooks with self-contained examples of common statistical analysis techniques.

The purpose is to provide at least one example for each of the test covered in the Inventory of statistical test recipes.

List of recipes#

Z-tests:
- One sample \(z\)-test: one_sample_z-test.ipynb
Proportion tests
- One-sample \(z\)-test for proportions
- Binomial test
- Two-sample \(z\)-test for proportions
T-tests
- One-sample \(t\)-test: one_sample_t-test.ipynb
- Welch’s two-sample \(t\)-test: two_sample_t-test.ipynb
- Two-sample \(t\)-test with pooled variance (not important)
- Paired \(t\)-test
Chi-square tests
- Chi-square test for goodness of fit
- Chi-square test of independence
- Chi-square test for homogeneity
- Chi-square test for the population variance
ANOVA tests
- One-way analysis of variance (ANOVA): ANOVA.ipynb
- Two-way ANOVA
Nonparametric tests
- Sign test for the population median
- One-sample Wilcoxon signed-rank test
- Mann-Whitney U-test: Mann-Whitney_U-test.ipynb
- Kruskal–Wallis analysis of variance by ranks
Resampling methods
- Simulation tests
- Two-sample permutation test
- Permutation ANOVA
Miscellaneous tests
- Equivalence tests: two_sample_equivalence_test.ipynb
- Kolmogorov–Smirnov test
- Shapiro–Wilk normality test

Template#

For each statistical testing recipe, the notebook follows the same structure:

Data
Assumptions
Hypotheses
Power calculations
Test statistic
Sampling distribution
Examples
- Example 0: synthetic data when H0 is true
- Example 1: synthetic data when H0 is false
- Examples 2…n: other examples
Effect size estimates
Related tests
Discussion
Links

Why use synthetic data#

We use “fake” data for the examples 0 and 1 in order to illustrate the canonical data type each statistical test is designed to detect. This is a good “sanity check” to use for any statistical analysis technique: before trying on your real-world dataset, try it on synthetic data to make sure it works as expected (is able to detect a difference when a difference exists, and correctly fails to reject H0 when no difference exists).

Statistical analysis examples

Contents

Statistical analysis examples#

List of recipes#

Template#

Why use synthetic data#