pandas Python：如何将数据框字典变成一个大数据框，其中列名是前一个字典的键？

Question

提问by pakkunrob

So my dataframe is made from lots of individual excel files, each with the the date as their file name and the prices of the fruits on that day in the spreadsheet, so the spreadsheets look something like this:

所以我的数据框是由许多单独的 excel 文件组成的，每个文件的文件名都是日期，电子表格中还有当天水果的价格，所以电子表格看起来像这样：

15012016:
Fruit     Price
Orange    1
Apple     2
Pear      3

16012016:
Fruit     Price
Orange    4
Apple     5
Pear      6

17012016:
Fruit     Price
Orange    7
Apple     8
Pear      9

So to put all that information together I run the following code to put all the information into a dictionary of dataframes (all fruit price files stored in 'C:\Fruit_Prices_by_Day'

因此，为了将所有信息放在一起，我运行以下代码将所有信息放入数据框字典中（所有水果价格文件都存储在“C:\Fruit_Prices_by_Day”中）

#find all the file names
file_list = []
for x in os.listdir('C:\Fruit_Prices_by_Day'):
    file_list.append(x) 

file_list= list(set(file_list))

d = {}

for date in Raw_list:
    df1 = pd.read_excel(os.path.join('C:\Fruit_Prices_by_Day', date +'.xlsx'), index_col = 'Fruit')
    d[date] = df1

Then this is the part where I'm stuck. How do I then make this dict into a dataframe where the column names are the dict keys i.e. the dates, so I can get the price of each fruit per day all in the same dataframe like:

然后这是我被卡住的部分。然后我如何将这个 dict 变成一个数据框，其中列名是 dict 键，即日期，这样我就可以在同一个数据框中获得每天每个水果的价格，例如：

          15012016   16012016   17012016   
Orange    1          4          7
Apple     2          5          8
Pear      3          6          9

Answer 1

回答by jezrael

You can try first set_indexof all dataframes in comprehensionand then use concatwith remove last level of multiindexin columns:

您可以首先尝试输入set_index所有数据框comprehension，然后使用concat删除multiindex列的最后一级：

 print d
{'17012016':     Fruit  Price
0  Orange      7
1   Apple      8
2    Pear      9, '16012016':     Fruit  Price
0  Orange      4
1   Apple      5
2    Pear      6, '15012016':     Fruit  Price
0  Orange      1
1   Apple      2
2    Pear      3}
d = { k: v.set_index('Fruit') for k, v in d.items()}

df = pd.concat(d, axis=1)
df.columns = df.columns.droplevel(-1) 
print df
        15012016  16012016  17012016
Fruit                               
Orange         1         4         7
Apple          2         5         8
Pear           3         6         9

Answer 2

回答by Igor Fobia

Something like this could work: loop over the dictionary, add the constant column with the dictionary key, concatenate and then set the date as index

这样的事情可以工作：循环字典，使用字典键添加常量列，连接然后将日期设置为索引

pd.concat(
    (i_value_df.assign(date=i_key) for i_key, i_value_df in d.items())
).set_index('date')

pandas Python：如何将数据框字典变成一个大数据框，其中列名是前一个字典的键？

提问by pakkunrob

回答by jezrael

回答by Igor Fobia

相关推荐

最近更新

标签

pandas Python：如何将数据框字典变成一个大数据框，其中列名是前一个字典的键？

提问by pakkunrob

回答by jezrael

回答by Igor Fobia

相关推荐

使用 python pandas 对大型 csv 文件的汇总统计

pandas ipython笔记本中的熊猫子图标题大小

Pandas 数据框：按两列分组，然后对另一列求平均值

将 psycopg2 DictRow 查询转换为 Pandas 数据框

相关推荐

最近更新

标签