Pandas常用操作
数据读入:
data=pd.read_csv("E:/all_data.csv, header=None, sep=" "")数据存储(无表头):
data.to_csv('E://AmazonData/data.csv', header=None, index=0)获取每个用户有多少个:
user_num = pd.DataFrame(user.value_counts())挑出不满足交易数量为1的用户:
multi_user = user_num[(~ user_num['user'].isin([1]))]剪切特定的列:
new_data = data.loc[data['user'].isin(multi_user.index)]删除特定的列:
new_data = new_data.drop(columns=['Unnamed: 0'])添加特定的列:
new_counts['userID'] = new_counts.index画直方图:
fig, ax = pyplot.subplots(figsize=(20, 6))sns.distplot(amazon_user, ax=ax)