使用 Pandas 中列的唯一值创建 DataFrame

Question

提问by Felipe Amaral Rodrigues

I'm new at Python and Pandas and I'm having troubles to solve a problem, I have a DF with multiple variables, as the example bellow:

我是 Python 和 Pandas 的新手，我在解决问题时遇到了麻烦，我有一个带有多个变量的 DF，如下例所示：

SRC Data1 Data2
AAA  180   122
BBB  168   121
CCC  165   147
DDD  140   156
EEE  152   103
AAA  170   100
CCC  166   112
DDD  116   155
EEE  179   119

And I'm expecting something like:

我期待这样的事情：

DF_A

SRC    Data1   Data2
AAA    180     122
AAA    170     100

DF_B

SRC    Data1   Data2
BBB     168     121

What I need is create a DF to each value in SRCand carry their respective data in Data1and Data2

我需要的是为SRC 中的每个值创建一个 DF并在Data1和Data2 中携带它们各自的数据

I have alredy use pd.DataFrame(Example.SRC.unique()) and get each unique values in SRCbut I don't know if this will help me.

我已经使用 pd.DataFrame(Example.SRC.unique()) 并在SRC 中获取每个唯一值，但我不知道这是否对我有帮助。

Thank you all!

谢谢你们！

Answer 1

回答by Andy Hayden

The neat way to do this is dict(iter(g)):

这样做的巧妙方法是dict(iter(g))：

In [11]: g = df.groupby("SRC", as_index=False)

In [12]: d = dict(iter(g))

In [13]: d
Out[13]:
{'AAA':    SRC  Data1  Data2
 0  AAA    180    122
 5  AAA    170    100, 'BBB':    SRC  Data1  Data2
 1  BBB    168    121, 'CCC':    SRC  Data1  Data2
 2  CCC    165    147
 6  CCC    166    112, 'DDD':    SRC  Data1  Data2
 3  DDD    140    156
 7  DDD    116    155, 'EEE':    SRC  Data1  Data2
 4  EEE    152    103
 8  EEE    179    119}

In [14]: d["AAA"]
Out[14]:
   SRC  Data1  Data2
0  AAA    180    122
5  AAA    170    100

You can pull out the subgroups without copying:

您可以在不复制的情况下拉出子组：

In [21]: g.get_group("AAA")
Out[21]:
   SRC  Data1  Data2
0  AAA    180    122
5  AAA    170    100

Note: you can get an iterable of the keys with g.groups.keys().

注意：您可以使用g.groups.keys().

Answer 2

回答by MaxU

I'd generate a dictionary of DFs:

我会生成一个 DF 字典：

In [247]: dfs = {n:g for n,g in df.groupby('SRC')}

In [248]: dfs['AAA']
Out[248]:
   SRC  Data1  Data2
0  AAA    180    122
5  AAA    170    100

In [249]: dfs['BBB']
Out[249]:
   SRC  Data1  Data2
1  BBB    168    121

In [253]: dfs.keys()
Out[253]: dict_keys(['EEE', 'DDD', 'CCC', 'BBB', 'AAA'])

a bit nicer way to achieve the same thing:

实现相同目标的更好方法：

dfs = dict(tuple(df.groupby('SRC')))

使用 Pandas 中列的唯一值创建 DataFrame

提问by Felipe Amaral Rodrigues

回答by Andy Hayden

回答by MaxU

相关推荐

最近更新

标签

使用 Pandas 中列的唯一值创建 DataFrame

提问by Felipe Amaral Rodrigues

回答by Andy Hayden

回答by MaxU

相关推荐

Pandas：数据帧错误 - 2 列通过，传递的数据有 3 列

Pandas - 按 id 分组并使用阈值删除重复项

将 Pandas 数据帧作为压缩 CSV 直接写入 Amazon s3 存储桶？

如何使用正则表达式在 Pandas 中将一列拆分为多列？

相关推荐

最近更新

标签