Data Visualization

Seaborn

Hyerang Raina Kim

4 min readAug 14, 2020

✔️ Having identical statistics doesn’t mean the individual data sets are all equal!!

Visualizing data by types: Numeric x Numeric

Visualizing data by types: Numeric x Categorical

import seaborn as sns# example datasets are given by seaborn. Imported datasets can be used just the same as we used earlier with pandas.raw = sns.load_datatset('tips')

Seaborn Basic function structure

sns.scatterplot(data=dataframe, x='total_bill', y='tip', hue='sex')

Data Distribution (Numeric vs. Numeric)

relplot(data=dataframe, x=<column>, y=<column>, hue=<column>, kind='scatter)

kind options: ‘scatter’(default), ‘line’

sns.relplot(data=raw, x='tip', y='total_bill')

jointplot(data = df, x = <coloumn>, y=<column>, kind = 'scatter)

kind options

❓ ‘scatter’(default): point

❓ ‘reg’: point + regression

❓ ‘kde’: cumulative distribution chart like map

sns.jointplot(data = raw, x = 'tip', y = 'total_bill')

sns.jointplot(data = raw, x = 'tip', y = 'total_bill', kind = 'kde')

sns.jointplot(data = raw, x = 'tip', y = 'total_bill', kind = 'regg')

sns.jointplot(data = raw, x = 'tip', y = 'total_bill', kind = 'hex')

Pairplot(data = df)

Visualize the relationship between each two column in the entire numeric data column in data frame

sns.pairplot(data = raw)

sns.pairplot(data = raw, hue = 'sex')

Data Distribution (Numeric vs. Categorical)

sns.boxplot(data = raw, x = 'day', y = 'tip)

The line in the box indicates where the datasets are heavily weighted and the dots above (could be placed at the bottom) indicates unusual data.

sns.boxplot(data = raw, x = 'day, y = 'tip', hue = 'smoker')

Boxplot does not show the individual value for each data so if the amount of data is low, we can not roughly estimate with boxplot.

sns.swarmplot(data = raw, x = 'day', y = 'tip')

sns.swarmplot(data = raw, x = 'day', y = 'tip', hue = 'smoker', dodge = True)

sns.barplot(data = raw, x = 'size', y = 'tip')

sns.barplot(data = raw, x = 'size', y = 'tip', hue = 'sex')

Data Distribution (Numeric vs. Categorical vs. Categorical)

If using heatmap, we can see the entire two categorical data distribution of numerical data value all in one by using color.

df = raw.pivot_table(index = 'day', columns = 'size', values = 'tip', aggfunc = 'mean')

sns.heatmap(data = df)

sns.heatmap(data = df, annot = True)

sns.heatmap(data = df, annot = True, fmt = '.2f')

sns.heatmap(data = df, annot = True, fmt = '.2f', cmap = 'Blues')

fmt options: ‘.1f’, ‘.2f’, ‘.3f’ …
cmap options: Reds, Blues, vlag, Pastel1

Sign up to discover human stories that deepen your understanding of the world.

Free

Distraction-free reading. No ads.

Organize your knowledge with lists and highlights.

Tell your story. Find your audience.

Membership

Read member-only stories

Support writers you read most

Earn money for your writing

Listen to audio narrations

Read offline with the Medium app

Data Visualization

Seaborn

Python

Written by Hyerang Raina Kim

0 Followers

1 Following

No responses yet

Write a response

What are your thoughts?

Also publish to my profile

More from Hyerang Raina Kim

Hyerang Raina Kim

Javascript Quick Refresher

Syntax Overview

Aug 5, 2020

Hyerang Raina Kim

Dynamic Routes & Advanced Models

Passing and Using Dynamic Data

Aug 28, 2020

Hyerang Raina Kim

Cart Model

nodejs

Aug 28, 2021

Hyerang Raina Kim

Switch from sqlite3 to PostgreSQL

Ruby on Rails — Onkey Website Project

Sep 2, 2021

See all from Hyerang Raina Kim

Recommended from Medium

CodeX

Andrew W. Pearson

The 15 Principles of Data Visualization

“Having all the information in the world at our fingertips doesn’t make it easier to communicate: it makes it harder,” says Cole Nussbaumer…

Aug 25, 2023

Top Tips for Creating a Great Dashboard: A Guide for Effective Data Visualization

Rita Angelou

Top Tips for Creating a Great Dashboard: A Guide for Effective Data Visualization

Human brain loves pictures. Psychologist Albert Mehrabian demonstrated that 93% of communication is nonverbal. Research at 3M Corporation…

Oct 5, 2024

Lists

Coding & Development

11 stories1033 saves

Predictive Modeling w/ Python

20 stories1856 saves

Practical Guides to Machine Learning

10 stories2225 saves

ChatGPT prompts

51 stories2643 saves

The Power of Iconography in Data Reporting and Visualization

Santhana Lakshmi Ponnurasan

The Power of Iconography in Data Reporting and Visualization

Icons have become a highly favored element in report and dashboard design. These visual symbols can communicate meaning swiftly and…

Feb 10

Creating a Venn Diagram Style Sales KPI in Power BI

Microsoft Power BI

Shashanka Shekhar

Creating a Venn Diagram Style Sales KPI in Power BI

Creating insightful and visually appealing sales KPIs (Key Performance Indicators) is crucial for data-driven decision-making in any…

Mar 3

HR Analytics Dashboard: Leveraging Python & Tableau for Data-Driven Insights

Arthur Nweke-Uchebo

HR Analytics Dashboard: Leveraging Python & Tableau for Data-Driven Insights

Data is the key to better hiring, performance evaluation, and salary decisions in today's workplace. HR professionals increasingly rely on…

3d ago

Data Visualization for Exploratory Data Analysis (EDA) in Python

Python Fundamentals

Data Visualization for Exploratory Data Analysis (EDA) in Python

Python Data Visualization Guide

Feb 25, 2024

See more recommendations

Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams