Python Pandas read_table 使用第一列作为索引

Question

提问by fricadelle

I have a little bit of a problem here. I have a txt file containing lines of the form (let's say for line 1):

我这里有点问题。我有一个包含表单行的 txt 文件（假设第 1 行）：

id1-a1-b1-c1

I want to load it in a data frame using pandas with the index being the id's and the columns name being 'A', 'B', 'C' and the values the corresponding ai, bi, ci

我想使用 Pandas 将其加载到数据框中，索引为 id，列名称为 'A'、'B'、'C' 以及相应的值 ai、bi、ci

at the end I want the dataframe to look like:

最后我希望数据框看起来像：

    'A'   'B'  'C'
id1  a1    b1   c1
id2  a2    b2   c2
...   ...   ...  ...

I may want to read by chunks in the file is large but let's assume I read at once:

我可能想按块读取文件很大，但让我们假设我一次读取：

with open('file.txt') as f:
    table = pd.read_table(f, sep='-', index_col=0, header=None,   lineterminator='\n')

and rename the columns

并重命名列

table.columns = ['A','B','C']

my current output is something like:

我目前的输出是这样的：

    'A'   'B'  'C'
0
id1  a1    b1   c1
id2  a2    b2   c2
...   ...   ...  ...

there is an extra row that I can't explain

有一行我无法解释

Thanks

谢谢

EDIT

编辑

when I try to add the field

当我尝试添加字段时

chunksize=20

and after doing:

并在做之后：

for chunk in table:
    print(chunk)

I get the following error:

我收到以下错误：

pandas.parser.CParserError: Error tokenizing data. C error: Calling read(nbytes) on source failed. Try engine='python'.

Answer 1

回答by Bryan

If you know the column names before the file is read, pass the list using namesparameter of read_table:

如果您在读取文件之前知道列名，请使用read_table 的names参数传递列表：

with open('file.txt') as f:
    table = pd.read_table(f, sep='-', index_col=0, header=None, names=['A','B','C'],
                          lineterminator='\n')

Which outputs:

哪些输出：

      A   B   C
id1  a1  b1  c1
id2  a2  b2  c2

Python Pandas read_table 使用第一列作为索引

提问by fricadelle

回答by Bryan

相关推荐

最近更新

标签

Python Pandas read_table 使用第一列作为索引

提问by fricadelle

回答by Bryan

相关推荐

如何在 Python 上使用 selenium webdriver 和 browsermob 代理捕获网络流量？

Python Pymongo 查找和修改

Python 如何获得组合框值

Python 我如何只读取文本文件每一行的第一个单词？

相关推荐

最近更新

标签