Python 将 NumPy 数组与 Pandas DataFrame 连接（加入）

Question

提问by Jamgreen

I have a pandas dataframe with 10 rows and 5 columns and a numpy matrix of zeros np.zeros((10,3)).

我有一个有 10 行和 5 列的 Pandas 数据框和一个 numpy 零矩阵np.zeros((10,3))。

I want to concat the numpy matrix to the pandas dataframe but I want to delete the last column from the pandas dataframe before concatenating the numpy array to it.

我想将 numpy 矩阵连接到 Pandas 数据帧，但我想在将 numpy 数组连接到它之前从 Pandas 数据帧中删除最后一列。

So I will end up with a matrix of 10 rows and 5 - 1 + 3 = 7 columns.

所以我最终会得到一个 10 行和 5 - 1 + 3 = 7 列的矩阵。

I guess I could use

我想我可以用

new_dataframe = pd.concat([
    original_dataframe,
    pd.DataFrame(np.zeros((10, 3)), dtype=np.int)
], axis=1, ignore_index=True)

where original_dataframehas 10 rows and 5 columns.

其中original_dataframe有 10 行和 5 列。

How do I delete the last column from original_dataframebefore concatenating the numpy array? And how do I make sure I preserve all the data types?

如何original_dataframe在连接 numpy 数组之前删除最后一列？以及如何确保保留所有数据类型？

Answer 1

回答by cs95

Setup

设置

np.random.seed(0)
df = pd.DataFrame(np.random.choice(10, (3, 3)), columns=list('ABC'))
df

   A  B  C
0  5  0  3
1  3  7  9
2  3  5  2

`np.column_stack`/ `stack(axis=1)`/ `hstack`

`np.column_stack`/ `stack(axis=1)`/`hstack`

pd.DataFrame(pd.np.column_stack([df, np.zeros((df.shape[0], 3), dtype=int)]))

   0  1  2  3  4  5
0  5  0  3  0  0  0
1  3  7  9  0  0  0
2  3  5  2  0  0  0

Useful (and performant), but does not retain the column names from df. If you really want to slice out the last column, use ilocand slice it out:

有用（和高性能），但不保留df. 如果您真的想切出最后一列，请使用iloc并将其切出：

pd.DataFrame(pd.np.column_stack([
    df.iloc[:, :-1], np.zeros((df.shape[0], 3), dtype=int)]))

   0  1  2  3  4
0  5  0  0  0  0
1  3  7  0  0  0
2  3  5  0  0  0

`pd.concat`

You will need to convert the array to a DataFrame.

您需要将数组转换为 DataFrame。

df2 = pd.DataFrame(np.zeros((df.shape[0], 3), dtype=int), columns=list('DEF'))
pd.concat([df, df2], axis=1)

   A  B  C  D  E  F
0  5  0  3  0  0  0
1  3  7  9  0  0  0
2  3  5  2  0  0  0

`DataFrame.assign`

If it's only adding constant values, you can use assign:

如果它只是添加常量值，则可以使用assign：

df.assign(**dict.fromkeys(list('DEF'), 0))

   A  B  C  D  E  F
0  5  0  3  0  0  0
1  3  7  9  0  0  0
2  3  5  2  0  0  0

Python 将 NumPy 数组与 Pandas DataFrame 连接（加入）

提问by Jamgreen

回答by cs95

`np.column_stack`/ `stack(axis=1)`/ `hstack`

`np.column_stack`/ `stack(axis=1)`/`hstack`

`pd.concat`

`pd.concat`

`DataFrame.assign`

`DataFrame.assign`

相关推荐

最近更新

标签

Python 将 NumPy 数组与 Pandas DataFrame 连接（加入）

提问by Jamgreen

回答by cs95

np.column_stack/ stack(axis=1)/ hstack

np.column_stack/ stack(axis=1)/hstack

pd.concat

pd.concat

DataFrame.assign

DataFrame.assign

相关推荐

Python在索引后找到第一次出现的字符

Python 在 Jupyter Notebook 中使用 matplotlib 绘制动态变化的图形

Python 3.6 中的 f 字符串

Python 熊猫：从时间戳中提取日期和时间

相关推荐

最近更新

标签

`np.column_stack`/ `stack(axis=1)`/ `hstack`

`np.column_stack`/ `stack(axis=1)`/`hstack`

`pd.concat`

`pd.concat`

`DataFrame.assign`

`DataFrame.assign`