pandas 从熊猫中的单个字符串列创建新的二进制列

Question

提问by user1610719

I've seen this before and simply can't remember the function.

我以前见过这个，只是不记得这个功能。

Say I have a column "Speed" and each row has 1 of these values:

假设我有一列“速度”，每一行都有以下值之一：

'Slow', 'Normal', 'Fast'

How do I create a new dataframe with all my rows except the column "Speed" which is now 3 columns: "Slow" "Normal" and "Fast" which has all of my rows labeled with a 1 in whichever column the old "Speed" column was. So if I had:

如何使用除“速度”列之外的所有行创建新数据框，该列现在有 3 列：“慢”、“正常”和“快”，其中我的所有行在旧的“速度”列中都标有 1 " 列了。所以如果我有：

print df['Speed'].ix[0]
> 'Normal'

I would not expect this:

我不希望这样：

print df['Normal'].ix[0]
>1

print df['Slow'].ix[0]
>0

Answer 1

回答by joris

You can do this easily with pd.get_dummies(docs):

您可以使用pd.get_dummies( docs)轻松完成此操作：

In [37]: df = pd.DataFrame(['Slow', 'Normal', 'Fast', 'Slow'], columns=['Speed'])

In [38]: df
Out[38]:
    Speed
0    Slow
1  Normal
2    Fast
3    Slow

In [39]: pd.get_dummies(df['Speed'])
Out[39]:
   Fast  Normal  Slow
0     0       0     1
1     0       1     0
2     1       0     0
3     0       0     1

Answer 2

回答by aha

Here is one solution:

这是一种解决方案：

df['Normal'] = df.Speed.apply(lambda x: 1 if x == "Normal" else 0)
df['Slow'] = df.Speed.apply(lambda x: 1 if x == "Slow" else 0)
df['Fast'] = df.Speed.apply(lambda x: 1 if x == "Fast" else 0)

Answer 3

回答by sun

This has another method：

这还有一个方法：

df           = pd.DataFrame(['Slow','Fast','Normal','Normal'],columns=['Speed'])
df['Normal'] = np.where(df['Speed'] == 'Normal', 1 ,0)
df['Fast']   = np.where(df['Speed'] == 'Fast', 1 ,0)
df['Slow']   = np.where(df['Speed'] == 'Slow', 1 ,0)

df 
     Speed  Normal  Fast  Slow
0    Slow       0     0     1
1    Fast       0     1     0
2  Normal       1     0     0
3  Normal       1     0     1

pandas 从熊猫中的单个字符串列创建新的二进制列

提问by user1610719

回答by joris

回答by aha

回答by sun

相关推荐

最近更新

标签

pandas 从熊猫中的单个字符串列创建新的二进制列

提问by user1610719

回答by joris

回答by aha

回答by sun

相关推荐

pandas 如何在pandas groupby中聚合多列

将文本添加到 Pandas 数据框图

pandas 基于正则表达式过滤数据框

根据 Pandas 中的组大小对分组数据进行排序

相关推荐

最近更新

标签