pandas 如何将列中的 k 均值预测聚类添加到 Python 中的数据帧

Question

提问by Keithx

I have a question about kmeans clustering in python.

我有一个关于 python 中 kmeans 聚类的问题。

So I did the analysis that way:

所以我是这样分析的：

from sklearn.cluster import KMeans

km = KMeans(n_clusters=12, random_state=1)
new = data._get_numeric_data().dropna(axis=1)
km.fit(new)
predict=km.predict(new)

How can I add the column with cluster results to my first dataframe "data" as an additional column? Thanks!

如何将带有聚类结果的列作为附加列添加到我的第一个数据框“数据”中？谢谢！

Answer 1

回答by Gal Dreiman

Assuming the column length is as the same as each column in you dataframe df, all you need to do is this:

假设列长度与 dataframe 中的每一列相同df，您需要做的就是：

df['NEW_COLUMN'] = pd.Series(predict, index=df.index)

pandas 如何将列中的 k 均值预测聚类添加到 Python 中的数据帧

提问by Keithx

回答by Gal Dreiman

相关推荐

最近更新

标签

pandas 如何将列中的 k 均值预测聚类添加到 Python 中的数据帧

提问by Keithx

回答by Gal Dreiman

相关推荐

pandas 熊猫中的“反合并”（Python）

pandas 熊猫：增加日期时间

pandas Python 错误：TypeError：无法将 dtyped [float64] 数组与 [bool] 类型的标量进行比较

pandas 逐行组合熊猫数据框的有效方法

相关推荐

最近更新

标签