Appendix E — Seaborn tutorial¶

See outline here:
https://docs.google.com/document/d/1fwep23-95U-w1QMPU31nOvUnUXE2X3s_Dbk5JuLlKAY/edit#bookmark=id.3i7cktuf1u3i

In this tutorial, we'll learn about Seaborn data visualizations. We'll discuss Seaborn plot functions We'll also describe the various options for customize plots' the appearance, add annotations, and export plots as publication-quality images.

If you want to pursue a career in a data-related field, I highly recommend you get to know Seaborn by reading this tutorial and the other resources in the links section.

Seaborn overview¶

The Seaborn library is a powerful toolbox for generating statistical data visualizations. Seaborn makes it very easy to visualize data stored in Pandas data frames. You can generate standard statistical plots like barplots, stripplots, scatterplots, using a single line of code. We'll look at a few examples of the Seaborn functions for generating statistical visualizations of data stored in Pandas data frames. The combination of the JupyterLab computational environment and the Python libraries Pandas and Seaborn is a best-in-class toolset for doing statistics in Python.

Seaborn includes numerous plot functions like stripplot, scatterplot, histplot, boxplot, barplot, and countplot. In this subsection, we'll show some examples of these function.

Basic plots¶

Line plot¶

In [1]:

Copied!

import seaborn as sns
import pandas as pd
import seaborn as sns
import pandas as pd

In [2]:

Copied!

days = [1, 2, 3, 4]
cakes = [2, 5, 3, 4]
sns.lineplot(x=days, y=cakes);
days = [1, 2, 3, 4]
cakes = [2, 5, 3, 4]
sns.lineplot(x=days, y=cakes);

No description has been provided for this image

In [3]:

Copied!





# # (optional) use Matplotlib axis methods to add labels
# ax = sns.lineplot(x=days, y=cakes)
# ax.set_xlabel("days")
# ax.set_ylabel("cakes")
# # (optional) use Matplotlib axis methods to add labels
# ax = sns.lineplot(x=days, y=cakes)
# ax.set_xlabel("days")
# ax.set_ylabel("cakes")

In [4]:

Copied!

df = pd.DataFrame({"days":days, "cakes":cakes})
df
df = pd.DataFrame({"days":days, "cakes":cakes})
df

Out[4]:

	days	cakes
0	1	2
1	2	5
2	3	3
3	4	4

In [5]:

Copied!

df.columns
df.columns

Out[5]:

Index(['days', 'cakes'], dtype='object')

In [11]:

Copied!

sns.lineplot(x="days", y="cakes", data=df);
sns.lineplot(x="days", y="cakes", data=df);

In [7]:

Copied!

# # ALT. hybrid approach
# sns.lineplot(x=df["days"], y=df["cakes"])
# # ALT. hybrid approach
# sns.lineplot(x=df["days"], y=df["cakes"])

Plotting function graphs¶

In [8]:

Copied!

def g(x):
    return 0.5 * x**2
def g(x):
    return 0.5 * x**2

In [9]:

Copied!





import numpy as np
xs = np.linspace(0, 10, 1000)
gxs = g(xs)
sns.lineplot(x=xs, y=gxs, label="Graph of g(x)");
import numpy as np
xs = np.linspace(0, 10, 1000)
gxs = g(xs)
sns.lineplot(x=xs, y=gxs, label="Graph of g(x)");

In [10]:

Copied!





# # FIGURES ONLY
# from ministats.utils import savefigure
# ax = sns.lineplot(x=xs, y=gxs, label="Graph of g(x)");
# filename = "figures/tutorials/seaborn/graph_of_function_g_eq_halfx2.pdf"
# savefigure(ax, filename)
# # FIGURES ONLY
# from ministats.utils import savefigure
# ax = sns.lineplot(x=xs, y=gxs, label="Graph of g(x)");
# filename = "figures/tutorials/seaborn/graph_of_function_g_eq_halfx2.pdf"
# savefigure(ax, filename)

Distribution plots¶

Strip plots¶

Scatter plots¶

Density plots¶

Histograms¶

Box plots¶

Violin plots¶

Categorical plots¶

Bar plots¶

Linear model plots¶

Linear model plots using `seaborn`¶

Linear model plots from scratch¶

Linear model plots using `statsmodels`¶

Other plots¶

Stem plot for discrete random variables¶

Customizing plots¶

Bonus topics¶

Data visualization tips¶

Links¶

Here are some links to learning resources for Seaborn and data visualization techniques.

Official docs¶

An introduction to seaborn
http://seaborn.pydata.org/introduction.html
Seaborn tutorials featuring lots of useful plot examples
https://seaborn.pydata.org/tutorial.html
Gallery of data visualizations produced using Seaborn
https://seaborn.pydata.org/examples/index.html

Tutorials¶

Python Seaborn Tutorial For Beginners
https://www.datacamp.com/community/tutorials/seaborn-python-tutorial
The Ultimate Python Seaborn Tutorial
https://elitedatascience.com/python-seaborn-tutorial
Python Seaborn Tutorial
https://www.geeksforgeeks.org/python-seaborn-tutorial/

Video tutorials¶

Intro to Seaborn by Kimberly Fessel (excellent!)
https://www.youtube.com/playlist?list=PLtPIclEQf-3cG31dxSMZ8KTcDG7zYng1j
see also notebooks from the videos.
Seaborn Tutorial 2021 by Derek Banas
https://www.youtube.com/watch?v=6GUZXDef2U0
Data Visualisation with Seaborn Crash Course by Valentine Mwangi
https://www.youtube.com/watch?v=zafPvR4MmBA See also the colab notebook for the course.