将表组合起来
1 | pd.concat([df1, df2, df3], axis = 0) |
1 | pd.merge(left, right, how = 'inner', left_on=None, right_on=None, left_index=False, right_index=False, ...) |
数据透视表pivot
1 | pd.pivot_table(df, index = 'y', columns = 'x', values = 'z', aggfunc = np.mean) |
交互表cross-table
1 | pd.crosstab(index = 'y', columns = 'x', values = 'z', aggfunc = '...', margins = True/False) |
读取和存储数据
1 | pandas.read_csv(filepath, sep = ',') |
创建dataframe or series
1 | d = pd.dataframe(data, index) |
常用操作
df.loc 索引的是标签名
df.iloc索引的是索引位置
1 | #切片 |
设定索引
1 | df.set_index(keys = ['col1','col2'],drop = True,append = False,inplace = True) |
多重索引
1 | #三重索引为例 |
分组计算groupby
1 | #groupby |