如何在python中从第k列开始删除具有空值的行

Question

提问by user1140126

I need to remove all rows in which elements from column 3 onwards are all NaN

我需要删除所有从第 3 列开始的元素都是 NaN 的行

df = DataFrame(np.random.randn(6, 5), index=['a', 'c', 'e', 'f', 'g','h'], columns=['one', 'two', 'three', 'four', 'five'])

df2 = df.reindex(['a', 'b', 'c', 'd', 'e', 'f', 'g', 'h'])
df2.ix[1][0] = 111
df2.ix[1][1] = 222

In the example above, my final data frame would not be having rows 'b' and 'c'.

在上面的例子中，我的最终数据框不会有“b”和“c”行。

How to use df.dropna()in this case?

df.dropna()在这种情况下如何使用？

Answer 1

采纳答案by Andy Hayden

You can call dropnawith arguments subsetand how:

您可以dropna使用参数subset和调用how：

df2.dropna(subset=['three', 'four', 'five'], how='all')

As the names suggests:

顾名思义：

how='all'requires every column (of subset) in the row to be NaNin order to be dropped, as opposed to the default 'any'.
subsetis those columns to inspect for NaNs.

how='all'要求行中的每一列 (of subset)NaN都被删除，而不是默认的'any'.
subset是要检查NaNs 的那些列。

As @PaulHpoints out, we can generalise to drop the last kcolumns with:

正如@PaulH指出的那样，我们可以概括为删除最后一k列：

subset=df2.columns[k:]

Indeed, we could even do something more complicated if desired:

事实上，如果需要，我们甚至可以做一些更复杂的事情：

subset=filter(lambda x: len(x) > 3, df2.columns)

如何在python中从第k列开始删除具有空值的行

提问by user1140126

采纳答案by Andy Hayden

相关推荐

最近更新

标签

如何在python中从第k列开始删除具有空值的行

提问by user1140126

采纳答案by Andy Hayden

相关推荐

Python 有没有办法让 Tkinter 文本小部件只读？

Python 糟糕的 Django / uwsgi 性能

Python 使 Tkinter 小部件成为焦点

Python 在字典中递归查找键

相关推荐

最近更新

标签