Pandas常用操作

  • 数据读入:

    data=pd.read_csv("E:/all_data.csv, header=None, sep=" "")

  • 数据存储(无表头):

    data.to_csv('E://AmazonData/data.csv', header=None, index=0)

  • 获取每个用户有多少个:

    user_num = pd.DataFrame(user.value_counts())

  • 挑出不满足交易数量为1的用户:

    multi_user = user_num[(~ user_num['user'].isin([1]))]

  • 剪切特定的列:

    new_data = data.loc[data['user'].isin(multi_user.index)]

  • 删除特定的列:

    new_data = new_data.drop(columns=['Unnamed: 0'])

  • 添加特定的列:

    new_counts['userID'] = new_counts.index

  • 画直方图:

    fig, ax = pyplot.subplots(figsize=(20, 6))sns.distplot(amazon_user, ax=ax)