이번에는 Seaborn의 다음과 같은 그래프에 대해 알아봅시다.

학습 활동은 아래를 참고하세요.

Q1 다음 각각이 의미하는 바는 무엇인가요?

In [1]:

import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
import seaborn as sns

Q2 sns.load_dataset()은 무엇을 하는 함수인가요?

In [2]:

titanic = sns.load_dataset('titanic')
titanic

Out[2]:

	survived	pclass	sex	age	sibsp	parch	fare	embarked	class	who	adult_male	deck	embark_town	alive	alone
0	0	3	male	22.0	1	0	7.2500	S	Third	man	True	NaN	Southampton	no	False
1	1	1	female	38.0	1	0	71.2833	C	First	woman	False	C	Cherbourg	yes	False
2	1	3	female	26.0	0	0	7.9250	S	Third	woman	False	NaN	Southampton	yes	True
3	1	1	female	35.0	1	0	53.1000	S	First	woman	False	C	Southampton	yes	False
4	0	3	male	35.0	0	0	8.0500	S	Third	man	True	NaN	Southampton	no	True
...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...
886	0	2	male	27.0	0	0	13.0000	S	Second	man	True	NaN	Southampton	no	True
887	1	1	female	19.0	0	0	30.0000	S	First	woman	False	B	Southampton	yes	True
888	0	3	female	NaN	1	2	23.4500	S	Third	woman	False	NaN	Southampton	no	False
889	1	1	male	26.0	0	0	30.0000	C	First	man	True	C	Cherbourg	yes	True
890	0	3	male	32.0	0	0	7.7500	Q	Third	man	True	NaN	Queenstown	no	True

891 rows × 15 columns

In [3]:

tips = sns.load_dataset('tips')

선형 회귀 그래프

lmplot은 column 간의 선형관계를 확인하기에 용이한 차트입니다.

Q3 선형 회귀란 무엇인가요?

In [4]:

sns.lmplot(x="total_bill", y="tip", height=8, data=tips)
plt.show()

Q4 위 그래프에서 x와 y는 무엇을 의미하나요? height=와 data=는 또한 무엇인가요?

Q5 만약 축의 제목을 바꾸고 싶다면 어떻게 해야 할까요?

In [5]:

sns.lmplot(x="total_bill", y="tip", hue="smoker", height=8, data=tips)
plt.show()

Q6 hue= 옵션은 어떤 결과를 가져왔나요?

In [6]:

sns.lmplot(x='total_bill', y='tip', hue='smoker', col='day', col_wrap=2, height=8, data=tips)
plt.show()

Q7 col= 옵션은 어떤 결과를 가져왔나요?

Q8 col_wrap=은 무엇을 하는 옵션인가요?

In [7]:

sns.relplot(x="total_bill", y="tip", hue="day", data=tips)
plt.show()

Q9 다중관계도표는 무엇을 하는 도표인가요?

Q10 x=,y=,hue=는 각각 무엇을 의미하나요?

In [8]:

sns.relplot(x="total_bill", y="tip", hue="day", col="time", data=tips)
plt.show()

Q11 선형 회귀와 비교했을 때 다중관계도표가 가지는 강점과 약점은 무엇인가요?

In [9]:

sns.relplot(x="total_bill", y="tip", hue="day", row="sex", col="time", data=tips)
plt.show()

Q12 위의 다중관계도표를 해석해보세요.

In [10]:

sns.relplot(x="total_bill", y="tip", hue="day", row="sex", col="time", palette='CMRmap_r', data=tips)
plt.show()

Q13 인터넷을 검색해서 platte=옵션에 더 올 수 있는 색상표를 찾아보세요.

In [11]:

sns.jointplot(x="total_bill", y="tip", height=8, data=tips)
plt.show()

Q14 분포산점도란 무엇인가요?

In [12]:

sns.jointplot("total_bill", "tip", height=8, data=tips, kind="reg")
plt.show()

Q15 옵션 kind='reg'는 어떤 결과를 만들었나요?

In [13]:

sns.jointplot("total_bill", "tip", height=8, data=tips, kind="hex")
plt.show()

Q16 밀도 폴리곤은 왜 필요할까요? 장점은 무엇인가요?

In [14]:

iris = sns.load_dataset('iris')
sns.jointplot("sepal_width", "petal_length", height=8, data=iris, kind="kde", color="g")
plt.show()

Q17 밀도 등고선은 왜 필요한가요? 장점은 무엇인가요?