Pandas 中的 Sumifs 有两个条件

Question

提问by Irwin Mooketsi Chelenyane

I want the pandas equivalent of the Excel's sumifsfor example

我想Pandas相当于Excel中的sumifs例如

=SUMIFS($D4:D7,$D7,$G4:G7)

I have three columns, the contract, the amountand transaction_type_tla. For each contract, I would like to sum the amountif the transaction type is CBP. The following formula is not working:

我有三列， the contract， theamount和transaction_type_tla。对于每个contract，我想总结amount交易类型是否为CBP. 以下公式不起作用：

data['Var']=(data.groupby('contract',"transaction_type_tla=='CBP'")['amount'].cumsum())

Answer 1

回答by YOBEN_S

Borrow jp'data :-)

借用 jp'data :-)

df['New']=df.groupby('contract').apply(lambda x : x['amount'][x['type']=='CBP'].cumsum()).reset_index(level=0,drop=True)
df
Out[258]: 
  contract  amount type    New
0        A     123  ABC    NaN
1        A     341  ABC    NaN
2        A     652  CBP  652.0
3        A     150  CBP  802.0
4        B     562  DEF    NaN
5        B     674  ABC    NaN
6        B     562  CBP  562.0
7        B     147  CBP  709.0

Answer 2

回答by ZaxR

Edit: I think @Wen's answer is more in line with what you're looking for, but in case you wanted the result as a series:

编辑：我认为@Wen 的答案更符合您的要求，但如果您希望将结果作为一个系列：

An easy way to do this is to first filter the list of transactions by the transaction_type_tla you're looking for and then apply the groupby and whatever aggregation method you want:

一种简单的方法是首先按您要查找的 transaction_type_tla 过滤交易列表，然后应用 groupby 和您想要的任何聚合方法：

ans = data[data['transaction_type_tla'] == 'CBP']
ans.groupby('contract')['amount'].cumsum()

This will result in a series with your answer.

这将导致您的答案系列。

Answer 3

回答by jpp

This is one way. I've set up some imaginary data to test.

这是一种方式。我已经设置了一些假想数据进行测试。

Output is dataframe in same format, but with CBPtransactions summed.

输出是相同格式的数据帧，但CBP总和交易。

import pandas as pd

df = pd.DataFrame([['A', 123, 'ABC'],
                   ['A', 341, 'ABC'],
                   ['A', 652, 'CBP'],
                   ['A', 150, 'CBP'],
                   ['B', 562, 'DEF'],
                   ['B', 674, 'ABC'],
                   ['B', 562, 'CBP'],
                   ['B', 147, 'CBP']],
                  columns=['contract', 'amount', 'type'])

s = df.groupby(['contract', 'type'])['amount'].sum()
df = df.set_index(['contract', 'type']).join(s, rsuffix='_group')

df.loc[pd.IndexSlice[:, 'CBP'], 'amount'] = df.loc[pd.IndexSlice[:, 'CBP'], 'amount_group']
df = df.drop('amount_group', 1).reset_index().drop_duplicates()

#   contract type  amount
# 0        A  ABC     123
# 1        A  ABC     341
# 2        A  CBP     802
# 4        B  ABC     674
# 5        B  CBP     709
# 7        B  DEF     562

Pandas 中的 Sumifs 有两个条件

提问by Irwin Mooketsi Chelenyane

回答by YOBEN_S

回答by ZaxR

回答by jpp

相关推荐

最近更新

标签

Pandas 中的 Sumifs 有两个条件

提问by Irwin Mooketsi Chelenyane

回答by YOBEN_S

回答by ZaxR

回答by jpp

相关推荐

pandas 在 Python 中使用 mca 包

pandas 熊猫数据框列表理解中的 If ElseIf Else 条件

两个 Pandas 数据框中的公共列列表

pandas 将索引号转换为 int (Python)

相关推荐

最近更新

标签