Python Pandas 合并给出错误“缓冲区的维数错误（预期为 1，得到 2）”

Question

提问by lathomas64

I am trying to do a pandas merge and get the above error from the title when I try to run it. I am using 3 columns to match on whereas just before I do similar merge on only 2 columns and it works fine.

我正在尝试进行熊猫合并，并在尝试运行它时从标题中获取上述错误。我使用 3 列进行匹配，而就在我仅对 2 列进行类似合并之前，它工作正常。

df = pd.merge(df, c, how="left",
        left_on=["section_term_ps_id", "section_school_id", "state"],
        right_on=["term_ps_id", "term_school_id", "state"])

columns for the two dataframes

两个数据框的列

df:

df：

Index([u'section_ps_id', u'section_school_id', u'section_course_number', u'secti on_term_ps_id', u'section_staff_ps_id', u'section_number', u'section_expression' , u'section_grade_level', u'state', u'sections_id', u'course_ps_id', u'course_sc hool_id', u'course_number', u'course_schd_dept', u'courses_id', u'school_ps_id', u'course_school_id', u'school_name', u'school_abbr', u'school_low_grade', u'sch ool_high_grade', u'school_alt_school_number', u'school_state', u'school_phone', u'school_fax', u'school_principal', u'school_principal_phone', u'school_principa l_email', u'school_asst_principal', u'school_asst_principal_phone', u'school_ass t_principal_email'], dtype='object')

索引([u'section_ps_id', u'section_school_id', u'section_course_number', u'section_term_ps_id', u'section_staff_ps_id', u'section_number', u'section_expression' , u'section_grade_level', u'state', u 'sections_id', u'course_ps_id', u'course_sc hool_id', u'course_number', u'course_schd_dept', u'courses_id', u'school_ps_id', u'course_school_id', u'school_name', u'school_abbr', u'school_low_grade', u'sch ool_high_grade', u'school_alt_school_number', u'school_state', u'school_phone', u'school_fax', u'school_principal', u'school_principal_phone', u'school_principal_school_phone' '，你'school_asst_principal_phone', u'school_ass t_principal_email'], dtype='object')

c:

C：

Index([u'term_ps_id', u'term_school_id', u'term_portion', u'term_start_date', u' term_end_date', u'term_abbreviation', u'term_name', u'state', u'terms_id', u'sch ool_ps_id', u'term_school_id', u'school_name', u'school_abbr', u'school_low_grad e', u'school_high_grade', u'school_alt_school_number', u'school_state', u'school _phone', u'school_fax', u'school_principal', u'school_principal_phone', u'school _principal_email', u'school_asst_principal', u'school_asst_principal_phone', u's chool_asst_principal_email'], dtype='object')

索引([u'term_ps_id', u'term_school_id', u'term_portion', u'term_start_date', u' term_end_date', u'term_abbreviation', u'term_name', u'state', u'terms_id', u' sch ool_ps_id', u'term_school_id', u'school_name', u'school_abbr', u'school_low_grad e', u'school_high_grade', u'school_alt_school_number', u'school_state', u'school _phone', u'school_fax' , u'school_principal', u'school_principal_phone', u'school_principal_email', u'school_asst_principal', u'school_asst_principal_phone', u's cool_asst_principal_email'], dtype='object

Is it possible to merge on three columns like this? Is there anything wrong from the merge call here?

是否可以像这样合并三列？这里的合并调用有什么问题吗？

Answer 1

采纳答案by JD Long

As mentioned in the comments, you have a dupe column:

正如评论中提到的，你有一个欺骗列：

enter image description here

在此处输入图片说明

Answer 2

回答by 2Obe

To adress the issue of the dupe columns you can either drop the dupe column using duplicatedwith smth. like:

要解决dupe 列的问题，您可以使用duplicatedwith smth删除dupe 列。喜欢：

c = c[~c.columns.duplicated(keep='first')]

or adding an additional char to either one of the DataFrames using for example: c.columns=[c.columns[i]+str(i) for i in range(len(c.columns))]

或使用例如： c.columns=[c.columns[i]+str(i) for i in range(len(c.columns))]

Keep in mind that in this case you must adjust the merging part

请记住，在这种情况下，您必须调整合并部分

Answer 3

回答by Shivpe_R

This Will remove the duplicated columns from the Dataframe

这将从数据框中删除重复的列

df = df[list(df.columns[~df.columns.duplicated()])]

Python Pandas 合并给出错误“缓冲区的维数错误（预期为 1，得到 2）”

提问by lathomas64

采纳答案by JD Long

回答by 2Obe

回答by Shivpe_R

相关推荐

最近更新

标签

Python Pandas 合并给出错误“缓冲区的维数错误（预期为 1，得到 2）”

提问by lathomas64

采纳答案by JD Long

回答by 2Obe

回答by Shivpe_R

相关推荐

Python 请求：不要等待请求完成

Python：将 DataFrame 的每一行除以另一个 DataFrame 向量

Python3：ImportError：使用来自模块多处理的值时没有名为“_ctypes”的模块

Python 使用 matplotlib 将 y 范围更改为从 0 开始

相关推荐

最近更新

标签