Python 查找列中的唯一值，然后对它们进行排序

Question

提问by MAS

I have a pandas dataframe. I want to print the unique values of one of its columns in ascending order. This is how I am doing it:

我有一个熊猫数据框。我想按升序打印其中一列的唯一值。这就是我的做法：

import pandas as pd
df = pd.DataFrame({'A':[1,1,3,2,6,2,8]})
a = df['A'].unique()
print a.sort()

The problem is that I am getting a Nonefor the output.

问题是我得到了一个None输出。

Answer 1

采纳答案by Vineet Kumar Doshi

sortedreturn a new sorted list from the items in iterable.

CODE

sorted从 iterable 中的项目返回一个新的排序列表。

代码

import pandas as pd
df = pd.DataFrame({'A':[1,1,3,2,6,2,8]})
a = df['A'].unique()
print sorted(a)

OUTPUT

输出

[1, 2, 3, 6, 8]

Answer 2

回答by EdChum

sortsorts inplace so returns nothing:

sort就地排序，因此不返回任何内容：

In [54]:
df = pd.DataFrame({'A':[1,1,3,2,6,2,8]})
a = df['A'].unique()
a.sort()
a

Out[54]:
array([1, 2, 3, 6, 8], dtype=int64)

So you have to call print aagain after the call to sort.

因此，您必须在调用print a之后再次调用sort。

Eg.:

例如。：

In [55]:
df = pd.DataFrame({'A':[1,1,3,2,6,2,8]})
a = df['A'].unique()
a.sort()
print(a)

[1 2 3 6 8]

Answer 3

回答by Challensois

I would suggest using numpy's sort, as it is anyway what pandas is doing in background:

我建议使用 numpy 的排序，因为无论如何熊猫在后台做什么：

import numpy as np
np.sort(df.A.unique())

But doing all in pandas is valid as well.

但是在 Pandas 中做所有事情也是有效的。

Answer 4

回答by Meloun

You can also use the drop_duplicates()instead of unique()

您还可以使用drop_duplicates()而不是 unique()

df = pd.DataFrame({'A':[1,1,3,2,6,2,8]})
a = df['A'].drop_duplicates()
a.sort()
print a

Answer 5

回答by MDMoore313

I prefer the oneliner:

我更喜欢oneliner：

print(sorted(df['Column Name'].unique()))

Answer 6

回答by Bowen Liu

Came across the question myself today. I think the reason that your code returns 'None' (exactly what I got by using the same method) is that

今天自己遇到了这个问题。我认为您的代码返回“无”的原因（正是我使用相同方法得到的）是

a.sort()

is calling the sort function to mutate the list a. In my understanding, this is a modification command. To see the result you have to use print(a).

正在调用排序函数来改变列表 a。在我的理解中，这是一个修改命令。要查看结果，您必须使用 print(a)。

My solution, as I tried to keep everything in pandas:

我的解决方案，因为我试图将所有内容都保存在熊猫中：

pd.Series(df['A'].unique()).sort_values()

Answer 7

回答by Ivan Carrasco Quiroz

Another way is using setdata type.

另一种方法是使用set数据类型。

Some characteristic of Sets:Sets are unordered, can include mixed data types, elements in a set cannot be repeated, are mutable.

集合的一些特性：集合是无序的，可以包含混合数据类型，集合中的元素不能重复，是可变的。

Solving your question:

解决您的问题：

df = pd.DataFrame({'A':[1,1,3,2,6,2,8]})
sorted(set(df.A))

The answer in Listtype:

列表类型的答案：

[1, 2, 3, 6, 8]

Python 查找列中的唯一值，然后对它们进行排序

提问by MAS

采纳答案by Vineet Kumar Doshi

回答by EdChum

回答by Challensois

回答by Meloun

回答by MDMoore313

回答by Bowen Liu

回答by Ivan Carrasco Quiroz

相关推荐

最近更新

标签

Python 查找列中的唯一值，然后对它们进行排序

提问by MAS

采纳答案by Vineet Kumar Doshi

回答by EdChum

回答by Challensois

回答by Meloun

回答by MDMoore313

回答by Bowen Liu

回答by Ivan Carrasco Quiroz

相关推荐

为什么使用 from __future__ import print_function 会破坏 Python2 风格的打印？

Python 如何在 rcParams 中使用 linestyle=None 在 matplotlib 中制作误差条图？

Python NumPy 的 transpose() 方法如何置换数组的轴？

Python 如何优雅地处理 SIGTERM 信号？

相关推荐

最近更新

标签

为什么使用 from future import print_function 会破坏 Python2 风格的打印？