在 Pandas 数据帧上使用 .replace() 方法时字典中的重叠键

Question

提问by Nirvan

I want to replace some values in a column of a dataframe using a dictionary that maps the old codes to the new codes.

我想使用将旧代码映射到新代码的字典替换数据帧列中的某些值。

di = dict( { "myVar": {11:0, 204:11} } )
mydata.replace( to_replace = di, inplace = True )

But some of the new codes and old codes overlap. When using the .replace method of the dataframe I encounter the error 'Replacement not allowed with overlapping keys and values'

但是一些新代码和旧代码重叠。使用数据框的 .replace 方法时遇到错误'Replacement not allowed with overlapping keys and values'

My current workaround is to replace replace the offending keys manually and then apply the dictionary to the remaining non-overlapping cases.

我目前的解决方法是手动替换替换有问题的键，然后将字典应用于剩余的非重叠案例。

mydata.loc[ mydata.myVar == 11, "myVar" ] = 0 
di = dict( { "myVar": {204:11} } )
mydata.replace( to_replace = di, inplace = True )

Is there a more compact way to do this?

有没有更紧凑的方法来做到这一点？

Answer 1

回答by Nirvan

I found an answer herethat uses the .map method on a series in conjunction with a dictionary. Here's an example recoding dictionary with overlapping keys and values.

我在这里找到了一个答案，该答案将 .map 方法与字典结合使用。这是一个具有重叠键和值的重新编码字典示例。

import pandas as pd
>>> df = pd.DataFrame( [1,2,3,4,1], columns = ['Var'] )
>>> df
   Var
0    1
1    2
2    3
3    4
4    1
>>> dict = {1:2, 2:3, 3:1, 4:3}
>>> df.Var.map( dict )
0    2
1    3
2    1
3    3
4    2
Name: Var, dtype: int64

UPDATE:

更新：

With map, every value in the original series must be mapped to a new value. If the mapping dictionary does not contain all the values of the original column, the unmapped values are mapped to NaN.

使用 map，原始系列中的每个值都必须映射到一个新值。如果映射字典不包含原始列的所有值，则未映射的值将映射到 NaN。

>>> df = pd.DataFrame( [1,2,3,4,1], columns = ['Var'] )
>>> dict = {1:2, 2:3, 3:1}
>>> df.Var.map( dict )
0    2.0
1    3.0
2    1.0
3    NaN
4    2.0
Name: Var, dtype: float64

在 Pandas 数据帧上使用 .replace() 方法时字典中的重叠键

提问by Nirvan

回答by Nirvan

相关推荐

最近更新

标签

在 Pandas 数据帧上使用 .replace() 方法时字典中的重叠键

提问by Nirvan

回答by Nirvan

相关推荐

pandas 如何在python中为绘图添加填充？

pandas 怎么把日期改成那个月的第一个日期？

pandas 如何从数据框中弹出行？

如何找到所有（）正则表达式序列到 Pandas 数据帧？

相关推荐

最近更新

标签