Python 使用 Pandas 读取带有前导空格的文本文件会给出 NaN 列

Question

提问by Caleb

I am using pandas.read_csv to read a whitespace delimited file. The file has a variable number of whitespace characters in front of every line (the numbers are right-aligned). When I read this file, it creates a column of NaN. Why does this happen, and what is the best way to prevent it?

我正在使用 pandas.read_csv 读取以空格分隔的文件。该文件的每一行前面都有可变数量的空白字符（数字右对齐）。当我阅读此文件时，它会创建一列 NaN。为什么会发生这种情况，预防它的最佳方法是什么？

Example:

例子：

Text file:

文本文件：

  9.0  3.3 4.0
 32.3 44.3 5.1
  7.2  1.1 0.9

Command:

命令：

import pandas as pd
pd.read_csv("test.txt",delim_whitespace=True,header=None)

Output:

输出：

    0     1     2    3
0 NaN   9.0   3.3  4.0
1 NaN  32.3  44.3  5.1
2 NaN   7.2   1.1  0.9

Answer 1

采纳答案by DSM

FWIW I tend to use \s+instead, and it doesn't suffer the same problem:

FWIW 我倾向于使用\s+，它不会遇到同样的问题：

>>> pd.read_csv("wspace.csv", header=None, delim_whitespace=True)
    0     1     2    3
0 NaN   9.0   3.3  4.0
1 NaN  32.3  44.3  5.1
2 NaN   7.2   1.1  0.9
>>> pd.read_csv("wspace.csv", header=None, sep=r"\s+")
      0     1    2
0   9.0   3.3  4.0
1  32.3  44.3  5.1
2   7.2   1.1  0.9

Python 使用 Pandas 读取带有前导空格的文本文件会给出 NaN 列

提问by Caleb

采纳答案by DSM

相关推荐

最近更新

标签

Python 使用 Pandas 读取带有前导空格的文本文件会给出 NaN 列

提问by Caleb

采纳答案by DSM

相关推荐

Python 如何使用 QFileDialog 选项并检索 saveFileName？

Python 在 matplotlib 中绘制不同的颜色

Python 输入和输出 numpy 数组到 h5py

如何在Python中读取输入文件？

相关推荐

最近更新

标签