Pandas Dataframe apply() 方法提供了一个行对象，但是如何访问索引值

Question

提问by Paul H

I am new to Panda's and DataFrames and have run into an issue. The DataFrame.apply() method passes a row parameter to the provided function. However I can't seem to find out what the index value corresponding to that row is from this row parameter.

我是 Panda's 和 DataFrames 的新手，遇到了一个问题。DataFrame.apply() 方法将行参数传递给提供的函数。但是我似乎无法从这个行参数中找出与该行对应的索引值是什么。

An example

一个例子

df = DataFrame ({'a' : np.random.randn(6),
         'b' : ['foo', 'bar'] * 3,
         'c' : np.random.randn(6)})

df = df.set_index('a')

def my_test2(row):
   return "{}.{}".format(row['a'], row['b'])

df['Value'] = df.apply(my_test2, axis=1)

Yields a KeyError

产生一个 KeyError

KeyError: ('a', u'occurred at index -1.16119852166')

The problem is that the row['a'] in the my_test2 method fails. If I don't do the df.set_index('a') it works fine, but I do want to have an index on a.

问题是 my_test2 方法中的 row['a'] 失败了。如果我不做 df.set_index('a') 它工作正常，但我确实想有一个索引。

I tried duplicating column a (once as index, and once as a column) and this works, but this just seems ugly and problematic.

我尝试复制 a 列（一次作为索引，一次作为列）并且这有效，但这看起来很丑陋且有问题。

Any ideas on how to get the corresponding index value given the row object?

关于如何获取给定行对象的相应索引值的任何想法？

Many thanks in advance.

提前谢谢了。

Answer 1

回答by BKay

I believe what you want is this:

我相信你想要的是这个：

def my_test(row):
   return "{}.{}".format(row.name, row['b'])

THis works because:

这是有效的，因为：

"{}.{}".format("ham", "cheese")

returns

回报

'ham.cheese'

and if you reference a single row, the name attribute returns the index. For the example above:

如果您引用单行，则 name 属性返回索引。对于上面的例子：

df.iloc[0].name

returns

回报

b                           foo
c                      1.417726
Value    0.7842562355491481.foo
Name: 0.784256235549, dtype: object

Therefore this function is equivalent to finding the index of the ith row and executing this command

因此这个函数相当于找到第i行的索引并执行这个命令

"{}.{}".format(df.iloc[i].name, df.iloc[i]['b'])

then the apply function does this for all rows.

然后 apply 函数对所有行执行此操作。

Pandas Dataframe apply() 方法提供了一个行对象，但是如何访问索引值

提问by Paul H

回答by BKay

相关推荐

最近更新

标签

Pandas Dataframe apply() 方法提供了一个行对象，但是如何访问索引值

提问by Paul H

回答by BKay

相关推荐

pandas to_sql 方法给出日期列错误

pandas 如何遍历数据框中的列？

pandas 熊猫 groupby 后缺少列

在 pandas.DataFrame 的对角线上设置值

相关推荐

最近更新

标签