Pandas 合并多个 csv 文件

Question

提问by warrenfitzhenry

I have multiple csv files that I would like to combine into one df.

我有多个 csv 文件，我想将它们合并为一个 df。

They are all in this general format, with two index columns:

它们都是这种通用格式，有两个索引列：

                                           1     2
CU0112-005287-7 Output Energy, (Wh/h)   0.064   0.066
CU0112-005287-7 Lights (Wh)                0     0

                                            1     2
CU0112-001885-L Output Energy, (Wh/h)   1.33    1.317
CU0112-001885-L Lights (Wh)             1.33    1.317

and so on...

等等...

The combined df would be:

合并后的 df 将是：

                                           1     2
CU0112-005287-7 Output Energy, (Wh/h)   0.064   0.066
CU0112-005287-7 Lights (Wh)                0     0
CU0112-001885-L Output Energy, (Wh/h)   1.33    1.317
CU0112-001885-L Lights (Wh)             1.33    1.317

I am trying this code:

我正在尝试这个代码：

import os
import pandas as pd
import glob

files = glob.glob(r'2017-12-05\Aggregated\*.csv')   //folder which contains all the csv files

df = pd.merge([pd.read_csv(f, index_col=[0,1])for f in files], how='outer')

df.to_csv(r'\merged.csv')

But I am getting this error:

但我收到此错误：

TypeError: merge() takes at least 2 arguments (2 given)

Answer 1

回答by jezrael

I think you need concatinstead merge:

我认为你需要concat，而不是merge：

df = pd.concat([pd.read_csv(f, index_col=[0,1]) for f in files])

Answer 2

回答by Yayati Sule

You can try the following. I made some changes to the DataFrame combining logic

您可以尝试以下操作。我对 DataFrame 组合逻辑进行了一些更改

import os
import pandas as pd
import glob

files = glob.glob(r'2017-12-05\Aggregated\*.csv')   //folder which contains all the csv files

df = reduce(lambda df1,df2: pd.merge(df1,df2,on='id',how='outer'),[pd.read_csv(f, index_col=[0,1])for f in files] )

df.to_csv(r'\merged.csv')

Answer 3

回答by Billy Bonaros

A simple way:

一个简单的方法：

Creating a list with the names of csvs:

创建一个带有 csvs 名称的列表：

files=listdir()
csvs=list()
for file in files:
    if file.endswith(".csv"):
        csvs.append(file)

concatenate the csvs:

连接 csvs：

data=pd.DataFrame()
for i in csvs:
    table=pd.read_csv(i)
    data=pd.concat([data,table])

Pandas 合并多个 csv 文件

提问by warrenfitzhenry

回答by jezrael

回答by Yayati Sule

回答by Billy Bonaros

相关推荐

最近更新

标签

Pandas 合并多个 csv 文件

提问by warrenfitzhenry

回答by jezrael

回答by Yayati Sule

回答by Billy Bonaros

相关推荐

Python pandas：同时在不同列上均值和求和分组

Pandas Dataframe：如何按索引选择一行，然后获取接下来的几行

Pandas 匹配多列并将匹配值作为单个新列获取

pandas 从 Google 财经下载股票价格

相关推荐

最近更新

标签