In this example I demonstrate cleaning data to protect participant anonymity, NumPy statistical tests, and usage of seaborn to plot data

Data retrieved from: https://www.kaggle.com/steveahn/memory-test-on-drugged-islanders-data

When we call to see the DataFrame we immediately see that participants data is exposing their name which is almost always unacceptable, lets fix that.

Lets now group the dataframe by Happy/Sad condition, drug and participant, lets see what the min, max, and mean is

Using NumPy we can easily see the shape and dimensions of the new dataframe and also tell what type of data is stored in the Diff category.

Lastly it would be really nice to plot this data! Using seaborn we set the theme to show a whitegrid, next we use the sns.boxplot() method to plot a boxplot of the data, this is done by specifying the x and y to be the dataframe columns we want, then setting the hue to be the next index column at which we are filtering the data in the boxplot at, and lastly we tell seaborn's boxplot method that our data is coming from our dataframe df. Calling the plt (matplotlib.pyplot) method plot() displays our graph. This show my experience and mastery over Seaborn categorical plots.