3.4pandas處理丟失數據
頭文件:
import numpy as np
import pandas as pd
丟棄數據部分:
dates = pd.date_range('20130101',periods=6)
df = pd.DataFrame(np.random.randn(6,4),index=dates,columns=['A','B','C','D'])
df.iloc[0,1] = np.nan
df.iloc[1,2] = np.nanprint(df.dropna(axis=0,how='any'))
#how = any是只要有nan就丟掉,如果how = all是只有全部是nan才丟掉
把nan的地方填0
print(df.fillna(value=0))
查看數據中各個位置是否是缺失
print(df.isnull)
查看數據中是否有缺失
print(np.any(df.isnull())==True)
3.5pandas導入導出
頭文件:
import pandas as pd
數據導入
data = pd.read_csv('student.csv')
數據導出
data.to_csv('student.csv')