Python Pandas 'DataFrame' 对象没有属性 'unique'

Question

提问by thesebeth

I'm working in pandas doing pivot tables and when doing the groupby (to count distinct observations) aggfunc={"person":{lambda x: len(x.unique())}}gives me the following error: 'DataFrame' object has no attribute 'unique'any ideas how to fix it?

我正在使用 Pandas 做数据透视表，并且在执行 groupby（计算不同的观察值）时 aggfunc={"person":{lambda x: len(x.unique())}}出现以下错误： 'DataFrame' object has no attribute 'unique'任何想法如何解决它？

Answer 1

回答by Alexander

DataFrames do not have that method; columns in DataFrames do:

DataFrames 没有那个方法；DataFrame 中的列执行以下操作：

df['A'].unique()

Or, to get the names with the number of observations (using the DataFrame given by closedloop):

或者，获取具有观察次数的名称（使用闭环给出的数据帧）：

>>> df.groupby('person').person.count()
Out[80]: 
person
0         2
1         3
Name: person, dtype: int64

Answer 2

回答by closedloop

Rather than removing duplicates during the pivot table process, use the df.drop_duplicates()function to selectively drop duplicates.

与其在数据透视表过程中删除重复项，不如使用该df.drop_duplicates()函数有选择地删除重复项。

For example if you are pivoting using these index='c0'and columns='c1'then this simple step yields the correct counts.

例如，如果您使用这些进行旋转index='c0'，columns='c1'那么这个简单的步骤会产生正确的计数。

In this example the 5th row is a duplicate of the 4th (ignoring the non-pivoted c2column

在此示例中，第 5 行是第 4 行的副本（忽略非透视c2列

import pandas as pd
data = {'c0':[0,1,0,1,1], 'c1':[0,0,1,1,1], 'person':[0,0,1,1,1], 'c_other':[1,2,3,4,5]}
df = pd.DataFrame(data)
df2 = df.drop_duplicates(subset=['c0','c1','person'])
pd.pivot_table(df2, index='c0',columns='c1',values='person', aggfunc='count')

This correctly outputs

这正确输出

Python Pandas 'DataFrame' 对象没有属性 'unique'

提问by thesebeth

回答by Alexander

回答by closedloop

相关推荐

最近更新

标签

Python Pandas 'DataFrame' 对象没有属性 'unique'

提问by thesebeth

回答by Alexander

回答by closedloop

相关推荐

Python 切片字典

Python Django 查询集过滤器 GT、LT、GTE、LTE 返回完整的对象列表

Python 套接字错误：地址已被使用

ValueError：解包的值太多（Python 2.7）

相关推荐

最近更新

标签