Python 通过在两个现有列上使用 lambda 函数在 Panda 中创建一个新列

Question

提问by piyush sharma

I am able to add a new column in Panda by defining user function and then using apply. However, I want to do this using lambda; is there a way around?

我可以通过定义用户函数然后使用 apply 在 Panda 中添加一个新列。但是，我想使用lambda来做到这一点；有办法解决吗？

For Example, dfhas two columns aand b. I want to create a new column cwhich is equal to the longest length between aand b.

例如，df有两列a和b。我想创建一个新列c，它等于a和之间的最长长度b。

Some thing like:

就像是：

df['c'] = df.apply(lambda x, len(df['a']) if len(df['a']) > len(df['b']) or len(df['b']) )

One approach:

一种方法：

df = pd.DataFrame({'a':['dfg','f','fff','fgrf','fghj'], 'b' : ['sd','dfg','edr','df','fghjky']})

df['c'] = df.apply(lambda x: max([len(x) for x in [df['a'], df['b']]]))
print df
      a       b   c
0   dfg      sd NaN
1     f     dfg NaN
2   fff     edr NaN
3  fgrf      df NaN
4  fghj  fghjky NaN

Answer 1

采纳答案by jezrael

You can use function mapand select by function np.wheremore info

您可以使用功能映射并按功能选择np.where更多信息

print df
#     a     b
#0  aaa  rrrr
#1   bb     k
#2  ccc     e
#condition if condition is True then len column a else column b
df['c'] = np.where(df['a'].map(len) > df['b'].map(len), df['a'].map(len), df['b'].map(len))
print df
#     a     b  c
#0  aaa  rrrr  4
#1   bb     k  2
#2  ccc     e  3

Next solution is with function applywith parameter axis=1:

下一个解决方案是使用带有参数的函数应用axis=1：

axis = 1 or ‘columns': apply function to each row

axis = 1 或 'columns'：对每一行应用函数

df['c'] = df.apply(lambda x: max(len(x['a']), len(x['b'])), axis=1)

Python 通过在两个现有列上使用 lambda 函数在 Panda 中创建一个新列

提问by piyush sharma

采纳答案by jezrael

相关推荐

最近更新

标签

Python 通过在两个现有列上使用 lambda 函数在 Panda 中创建一个新列

提问by piyush sharma

采纳答案by jezrael

相关推荐

Python 使用 psycopg2 创建 postgresql 数据库

在 Ubuntu 12.04 中的 Python 2.7 中导入 Tensorflow 时出错。'未找到 GLIBC_2.17'

Python 了解 matplotlib xticks 语法

Python 无法导入 Tensorflow“没有名为 copyreg 的模块”

相关推荐

最近更新

标签