pandas 熊猫读取带有部分通配符的csv文件

Question

提问by Kvothe

I'm trying to write a script that imports a file, then does something with the file and outputs the result into another file.

我正在尝试编写一个导入文件的脚本，然后对该文件执行某些操作并将结果输出到另一个文件中。

df = pd.read_csv('somefile2018.csv')

The above code works perfectly fine. However, I'd like to avoid hardcoding the file name in the code.

上面的代码工作得很好。但是，我想避免在代码中硬编码文件名。

The script will be run in a folder (directory) that contains the script.pyand several csv files.

该脚本将在包含script.py和多个 csv 文件的文件夹（目录）中运行。

I've tried the following:

我尝试了以下方法：

somefile_path = glob.glob('somefile*.csv')

df = pd.read_csv(somefile_path)

But I get the following error:

但我收到以下错误：

ValueError: Invalid file path or buffer object type: <class 'list'>

Answer 1

回答by James

globreturns a list, not a string. The read_csvfunction takes a string as the input to find the file. Try this:

glob返回一个列表，而不是一个字符串。该read_csv函数将一个字符串作为输入来查找文件。尝试这个：

for f in glob('somefile*.csv'):
    df = pd.read_csv(f)
    ...
    # the rest of your script

Answer 2

回答by iDrwish

You can get the list of the CSV files in the script and loop over them.

您可以在脚本中获取 CSV 文件的列表并循环遍历它们。

from os import listdir
from os.path import isfile, join
mypath = os.getcwd()

csvfiles = [f for f in listdir(mypath) if isfile(join(mypath, f)) if '.csv' in f]

for f in csvfiles:
    pd.read_csv(f)
# the rest of your script

Answer 3

回答by pleicht17

To read all of the files that follow a certain pattern, so long as they share the same schema, use this function:

要读取遵循特定模式的所有文件，只要它们共享相同的架构，请使用此函数：

import glob
import pandas as pd

def pd_read_pattern(pattern):
    files = glob.glob(pattern)

    df = pd.DataFrame()
    for f in files:
        df = df.append(pd.read_csv(f))

    return df.reset_index(drop=True)

df = pd_read_pattern('somefile*.csv')

This will work with either an absolute or relative path.

这将适用于绝对或相对路径。

Answer 4

回答by Boud

Loop over each file and build a list of DataFrame, then assemble them together using concat.

循环遍历每个文件并构建一个 DataFrame 列表，然后使用concat.

pandas 熊猫读取带有部分通配符的csv文件

提问by Kvothe

回答by James

回答by iDrwish

回答by pleicht17

回答by Boud

相关推荐

最近更新

标签

pandas 熊猫读取带有部分通配符的csv文件

提问by Kvothe

回答by James

回答by iDrwish

回答by pleicht17

回答by Boud

相关推荐

pandas 熊猫箱线图作为具有单独 y 轴的子图

pandas 熊猫“时间戳”对象不可下标

pandas 熊猫 dropna() 功能不起作用

python - Pandas - Dataframe.set_index - 如何保留旧的索引列

相关推荐

最近更新

标签