什么是 Pandas 中 dataframe.loc() 的 Numpy 等价物

Question

提问by Chris

I have a 120,000*4 numpy array as shown below. Each row is a sample. The first column is time in second, or the indexusing Pandas terminology.

我有一个 120,000*4 的 numpy 数组，如下所示。每一行都是一个样本。第一列是以秒为单位的时间，或index使用 Pandas 术语。

0.014      14.175  -29.97  -22.68 
0.022      13.905  -29.835 -22.68
0.030      12.257  -29.32  -22.67
... ...
1259.980   -0.405   2.205   3.825
1259.991   -0.495   2.115   3.735

I want to select the rows recorded between 100.000 to 200.000 sec and save it into a new array. If this were a Pandas dataframe, I would simply write df.loc[100:200]. What is the equivalent operation in numpy?

我想选择记录在 100.000 到 200.000 秒之间的行并将其保存到一个新数组中。如果这是 Pandas 数据框，我会简单地写df.loc[100:200]. numpy 中的等效操作是什么？

This is NOT a question of feasibility. I simply wonder if there are any pythonic one-line solutions.

这不是可行性问题。我只是想知道是否有任何 pythonic 单行解决方案。

Answer 1

采纳答案by rafaelc

This assumes indexes are sorted:

这假设索引已排序：

IIUC,

国际大学联盟，

x=np.array([ [1,2,3,4],
           [5,6,7,8],
           [9,10,11,12],
           [13,14,15,16]])

x[(x[:,0] >= 5) & (x[:,0] <= 9) ]

So you would have 100 and 200 instead of 5 and 9.

所以你会有 100 和 200 而不是 5 和 9。

For a more general solution, check Wen`s answer

如需更通用的解决方案，请查看Wen 的回答

Answer 2

回答by YOBEN_S

Data from Raf

来自 Raf 的数据

x[np.where(x[:,0]==5)[0][0]:np.where(x[:,0]==9)[0][0]+1,:]
Out[341]: 
array([[ 5,  6,  7,  8],
       [ 9, 10, 11, 12]])

Notice

注意

only using greater and less than for that can not fully replace the .loc, the back end of .loc is index position not value range

只用大于和小于不能完全替代.loc，.loc的后端是索引位置而不是取值范围

For example

例如

df
Out[348]: 
       0   1   2   3
0      1   2   3   4
1      5   6   7   8
4444   9  10  11  12
3     13  14  15  16

df.loc[1:3]
Out[347]: 
       0   1   2   3
1      5   6   7   8
4444   9  10  11  12
3     13  14  15  16

什么是 Pandas 中 dataframe.loc() 的 Numpy 等价物

提问by Chris

采纳答案by rafaelc

回答by YOBEN_S

相关推荐

最近更新

标签

什么是 Pandas 中 dataframe.loc() 的 Numpy 等价物

提问by Chris

采纳答案by rafaelc

回答by YOBEN_S

相关推荐

pandas Python - 'TypeError: '<=' 在 'str' 和 'int' 的实例之间不受支持

pandas 使用pandas将数据框导出到python中的csv文件

ModuleNotFoundError：没有名为“pandas.core.indexes”的模块

使用 Pandas，我如何根据第一个空间进行拆分。

相关推荐

最近更新

标签