Pandas 中的多个同名列

Question

提问by vks

I am creating a dataframefrom a CSV file. I have gone through the docs, multiple SO posts, links as I have just started Pandas but didn't get it. The CSV file has multiple columns with same names say a.

我正在dataframe从 CSV 文件创建一个。我已经浏览了文档、多个 SO 帖子、链接，因为我刚刚开始使用 Pandas 但没有得到它。CSV 文件有多个具有相同名称的列，例如a.

So after forming dataframeand when I do df['a']which value will it return? It does not return all values.

那么在形成之后dataframe以及当我执行df['a']哪个值时它会返回？它不会返回所有值。

Also only one of the values will have a string rest will be None. How can I get that column?

此外，只有一个值将具有字符串 rest 将是None. 我怎样才能得到那个专栏？

Answer 1

回答by piRSquared

the relevant parameter is mangle_dupe_cols

相关参数是 mangle_dupe_cols

from the docs

从文档

mangle_dupe_cols : boolean, default True
    Duplicate columns will be specified as 'X.0'...'X.N', rather than 'X'...'X'

mangle_dupe_cols : boolean, default True
    Duplicate columns will be specified as 'X.0'...'X.N', rather than 'X'...'X'

by default, all of your 'a'columns get named 'a.0'...'a.N'as specified above.

默认情况下，您的所有'a'列都'a.0'...'a.N'按照上面指定的方式命名。

if you used mangle_dupe_cols=False, importing this csvwould produce an error.

如果您使用mangle_dupe_cols=False，导入它csv会产生错误。

you can get all of your columns with

你可以得到你所有的列

df.filter(like='a')

demonstration

示范

from StringIO import StringIO
import pandas as pd

txt = """a, a, a, b, c, d
1, 2, 3, 4, 5, 6
7, 8, 9, 10, 11, 12"""

df = pd.read_csv(StringIO(txt), skipinitialspace=True)
df

df.filter(like='a')

Pandas 中的多个同名列

提问by vks

回答by piRSquared

相关推荐

最近更新

标签

Pandas 中的多个同名列

提问by vks

回答by piRSquared

相关推荐

pandas 大熊猫数据帧上的外部合并导致 MemoryError ---如何与大熊猫合并“大数据”？

pandas 带删除的熊猫随机样本

将不同目录中的多个 .csv 文件读入 Pandas DataFrame

pandas Python 3 statsmodels Logit ValueError：在进入 DLASCL 参数编号 5 时有一个非法值

相关推荐

最近更新

标签