pandas 通过pandas.read_excel跳过标题后的行范围

Question

提问by florence-y

I know the argument usecolsin pandas.read_excel()allows you to select specific columns.

我知道参数usecols中pandas.read_excel()，您可以选择特定的列。

Say I read an Excel file in with pandas.read_excel(). My excel spreadsheet has 1161 rows. I want to keep the 1st row (with index 0), and skip rows 2:337. Seems like the argument skiprowsworks only when 0 indexing is involved. I don't know if I could be wrong, but several runs of my code always produces an output of reading allmy 1161 rows rather than only after the 337th row on. Such as this:

假设我用pandas.read_excel(). 我的 Excel 电子表格有 1161 行。我想保留第一行（索引为 0），并跳过 2:337 行。似乎该参数skiprows仅在涉及 0 索引时才有效。我不知道我是否可能是错的，但我的代码的几次运行总是产生读取我所有1161 行的输出，而不是仅在第 337 行之后。比如这个：

documentationscore_dataframe = pd.read_excel("Documentation Score Card_17DEC2015 Rev 2 17JAN2017.xlsx",
                                        sheet_name = "Sheet1",
                                        skiprows = "336",
                                        usecols = "H:BD")

Here is another attempt of what I have set up.

这是我设置的另一种尝试。

documentationscore_dataframe = pd.read_excel("Documentation Score Card_17DEC2015 Rev 2 17JAN2017.xlsx",
                                        sheet_name = "Sheet1",
                                        skiprows = "1:336",
                                        usecols = "H:BD")

I would like the dataframe to exclude rows 2 through 337 in the original Excel import.

我希望数据框在原始 Excel 导入中排除第 2 行到第 337 行。

Answer 1

回答by jpp

As per the documentationfor pandas.read_excel, skiprowsmust be list-like.

按照该文件的pandas.read_excel，skiprows必须是列表等。

Try this instead to exclude rows 1 to 336 inclusive:

试试这个来排除第 1 行到第 336 行：

df = pd.read_excel("file.xlsx",
                   sheet_name = "Sheet1",
                   skiprows = range(1, 337),
                   usecols = "H:BD")

Note: rangeconstructor is considered list-like for this purpose, so no explicit list conversion is necessary.

注意：出于此目的，range构造函数被视为list类似，因此不需要显式列表转换。

Answer 2

回答by Abdul-Razak Adam

Try this out

试试这个

rows_to_skip = list(range(1, 337)) #list of rows you want to skip
documentationscore_dataframe = pd.read_excel("Documentation Score Card_17DEC2015 Rev 2 17JAN2017.xlsx",
                                    sheet_name = "Sheet1",
                                    skiprows = rows_to_skip,
                                    usecols = "H:BD")

pandas 通过pandas.read_excel跳过标题后的行范围

提问by florence-y

回答by jpp

回答by Abdul-Razak Adam

相关推荐

最近更新

标签

pandas 通过pandas.read_excel跳过标题后的行范围

提问by florence-y

回答by jpp

回答by Abdul-Razak Adam

相关推荐

pandas 'DataFrame' 对象没有属性 'isna'

pandas 制作单行数据框

尝试使用 pip install pandas 时给出的双重要求

Pandas 解析 csv 错误 - 预期找到 1 个字段 9

相关推荐

最近更新

标签