pandas 级别 NaN 必须与名称相同

Question

提问by Ian Dzindo

I am trying to count how many times NaN appears in a column of a dataframe using this code:

我正在尝试使用以下代码计算 NaN 在数据帧的列中出现的次数：

count = enron_df.loc['salary'].count('NaN')

But every time i run this i get the following error:

但是每次我运行它时，我都会收到以下错误：

KeyError: 'Level NaN must be same as name (None)'

I searched around the web a lot trying to find a solution, but to no avail.

我在网上搜索了很多试图找到解决方案，但无济于事。

Answer 1

回答by jezrael

If NaNs are missing values:

如果NaNs 是缺失值：

enron_df = pd.DataFrame({'salary':[np.nan, np.nan, 1, 5, 7]})
print (enron_df)
   salary
0     NaN
1     NaN
2     1.0
3     5.0
4     7.0

count = enron_df['salary'].isna().sum()
#alternative
#count = enron_df['salary'].isnull().sum()
print (count)
2

If NaNs are strings:

如果NaNs 是strings：

enron_df = pd.DataFrame({'salary':['NaN', 'NaN', 1, 5, 'NaN']})
print (enron_df)
  salary
0    NaN
1    NaN
2      1
3      5
4    NaN

count = enron_df['salary'].eq('NaN').sum()
#alternative
#count = (enron_df['salary'] == 'NaN').sum()
print (count)
3

Answer 2

回答by rafaelc

By definition, countomits NaNs and sizedoes not.

根据定义，count省略NaNs 和size不省略。

Thus, a simple difference should do

因此，一个简单的区别应该做

count = enron_df['salary'].size - enron_df['salary'].count()

Answer 3

回答by zipa

Try like this:

像这样尝试：

count = df.loc[df['salary']=='NaN'].shape[0]

Or maybe better:

或者也许更好：

count = df.loc[df['salary']=='NaN', 'salary'].size

And, going down your path, you'd need something like this:

而且，沿着你的道路，你需要这样的东西：

count = df.loc[:, 'salary'].str.count('NaN').sum()

Answer 4

回答by ALollz

There's also value counts with the dropnaargument

dropna参数也有值计数

import numpy as np
import pandas as pd

enron_df = pd.DataFrame({'salary':[np.nan, np.nan, 1, 5, 7]})

enron_df.salary.value_counts(dropna=False)
#NaN     2
# 7.0    1
# 5.0    1
# 1.0    1
#Name: salary, dtype: int64

And if you just want the number, just select np.NaNfrom value counts. (If they are strings 'NaN', then just replace np.NaNwith 'NaN')

如果您只想要数字，只需np.NaN从值计数中进行选择。（如果它们是 strings 'NaN'，那么只需替换np.NaN为'NaN'）

enron_df.salary.value_counts(dropna=False)[np.NaN]
#2

pandas 级别 NaN 必须与名称相同

提问by Ian Dzindo

回答by jezrael

回答by rafaelc

回答by zipa

回答by ALollz

相关推荐

最近更新

标签

pandas 级别 NaN 必须与名称相同

提问by Ian Dzindo

回答by jezrael

回答by rafaelc

回答by zipa

回答by ALollz

相关推荐

Pandas 解析 csv 错误 - 预期找到 1 个字段 9

pandas 在 Python 中使用 geopy 进行地理编码时出现错误 (429) 请求过多

标准化 Python Pandas 数据框中的某些列？

Pandas 的 concat 函数中的“级别”、“键”和名称参数是什么？

相关推荐

最近更新

标签