pandas 匹配列名时出现值错误

Question

提问by shantanuo

The following code shows an error. But it works if I remove usercols parameter.

下面的代码显示了一个错误。但是如果我删除 usercols 参数它会起作用。

from StringIO import StringIO
import pandas as pd

u_cols = ['page_id','web_id']
audit_trail = StringIO('''
page_id | web_id
3|0
7|3
11|4
15|5
19|6
''')

df = pd.read_csv(audit_trail, sep="|", usecols = u_cols  )

ValueError: Passed header names mismatches usecols

ValueError：传递的标头名称与 usecols 不匹配

I need to use u_cols list because the column headings are being generated dynamically.

我需要使用 u_cols 列表，因为列标题是动态生成的。

Answer 1

回答by shantanuo

"names" should be used instead of "usecolmns"

应该使用“名称”而不是“usecolmns”

from StringIO import StringIO
import pandas as pd

u_cols = ['page_id','web_id']
audit_trail = StringIO('''
page_id | web_id
3|0
7|3
11|4
15|5
19|6
''')

df11 = pd.read_csv(audit_trail, sep="|", names = u_cols  )

Answer 2

回答by ZJS

This is because of the white space next to the | seperator. When you run pd.read_csv(audit_trail,sep="|")you actually have the columns ['page_id(whitespace)','(whitespace)web_id'] instead of ['page_id','web_id'].

这是因为 | 旁边的空白区域分隔符。当您运行时，pd.read_csv(audit_trail,sep="|")您实际上拥有列 ['page_id(whitespace)','(whitespace)web_id'] 而不是 ['page_id','web_id']。

I would suggest passing the following regex pattern as your seperator \s*\|\s*, which will remove any whitespace around the | seperator. Here is the full solution...

我建议将以下正则表达式模式作为分隔符传递\s*\|\s*，这将删除 | 周围的任何空格。分隔符。这是完整的解决方案......

u_cols = ['page_id','web_id']

"""page_id | web_id
3|0
7|3
11|4
15|5
19|6"""

df = pd.read_csv(StringIO(s),sep="\s*\|\s*",usecols = u_cols)

output

输出

   page_id  web_id
0        3       0
1        7       3
2       11       4
3       15       5
4       19       6

pandas 匹配列名时出现值错误

提问by shantanuo

回答by shantanuo

回答by ZJS

相关推荐

最近更新

标签

pandas 匹配列名时出现值错误

提问by shantanuo

回答by shantanuo

回答by ZJS

相关推荐

pandas Python 熊猫：pd.options.display.mpl_style = 'default' 导致图形崩溃

pandas 使用 Seaborn FacetGrid 从数据框中绘制错误条

Python pandas：如何从数据帧的时间戳中获取小时？

pandas 熊猫只从数据框中选择数字或整数字段

相关推荐

最近更新

标签