Python 如何将熊猫数据框的索引转换为列？

Question

提问by msakya

This seems rather obvious, but I can't seem to figure out how to convert an index of data frame to a column?

这似乎很明显，但我似乎无法弄清楚如何将数据框的索引转换为列？

For example:

例如：

df=
        gi       ptt_loc
 0  384444683      593  
 1  384444684      594 
 2  384444686      596

To,

到，

df=
    index1    gi       ptt_loc
 0  0     384444683      593  
 1  1     384444684      594 
 2  2     384444686      596

Answer 1

采纳答案by behzad.nouri

either:

任何一个：

df['index1'] = df.index

or, .reset_index:

或者，.reset_index：

df.reset_index(level=0, inplace=True)

so, if you have a multi-index frame with 3 levels of index, like:

因此，如果您有一个具有 3 个索引级别的多索引框架，例如：

>>> df
                       val
tick       tag obs        
2016-02-26 C   2    0.0139
2016-02-27 A   2    0.5577
2016-02-28 C   6    0.0303

and you want to convert the 1st (tick) and 3rd (obs) levels in the index into columns, you would do:

并且您想将索引中的第 1 ( tick) 和第 3 ( obs) 级转换为列，您可以这样做：

>>> df.reset_index(level=['tick', 'obs'])
          tick  obs     val
tag                        
C   2016-02-26    2  0.0139
A   2016-02-27    2  0.5577
C   2016-02-28    6  0.0303

Answer 2

回答by Apogentus

For MultiIndex you can extract its subindex using

对于 MultiIndex，您可以使用提取其子索引

df['si_name'] = R.index.get_level_values('si_name')

where si_nameis the name of the subindex.

其中si_name是子索引的名称。

Answer 3

回答by Ted Petrou

To provide a bit more clarity, let's look at a DataFrame with two levels in its index (a MultiIndex).

为了更清晰，让我们看一下索引中有两个级别（MultiIndex）的 DataFrame。

index = pd.MultiIndex.from_product([['TX', 'FL', 'CA'], 
                                    ['North', 'South']], 
                                   names=['State', 'Direction'])

df = pd.DataFrame(index=index, 
                  data=np.random.randint(0, 10, (6,4)), 
                  columns=list('abcd'))

The reset_indexmethod, called with the default parameters, converts all index levels to columns and uses a simple RangeIndexas new index.

该reset_index方法使用默认参数调用，将所有索引级别转换为列并使用简单RangeIndex索引作为新索引。

df.reset_index()

Use the levelparameter to control which index levels are converted into columns. If possible, use the level name, which is more explicit. If there are no level names, you can refer to each level by its integer location, which begin at 0 from the outside. You can use a scalar value here or a list of all the indexes you would like to reset.

使用该level参数来控制将哪些索引级别转换为列。如果可能，请使用更明确的级别名称。如果没有级别名称，您可以通过其整数位置来引用每个级别，该位置从外部从 0 开始。您可以在此处使用标量值或要重置的所有索引的列表。

df.reset_index(level='State') # same as df.reset_index(level=0)

In the rare event that you want to preserve the index and turn the index into a column, you can do the following:

在极少数情况下，您希望保留索引并将索引转换为列，您可以执行以下操作：

# for a single level
df.assign(State=df.index.get_level_values('State'))

# for all levels
df.assign(**df.index.to_frame())

Answer 4

回答by bunji

If you want to use the reset_indexmethod and also preserve your existing index you should use:

如果要使用该reset_index方法并保留现有索引，则应使用：

df.reset_index().set_index('index', drop=False)

or to change it in place:

或将其更改到位：

df.reset_index(inplace=True)
df.set_index('index', drop=False, inplace=True)

For example:

例如：

print(df)
          gi  ptt_loc
0  384444683      593
4  384444684      594
9  384444686      596

print(df.reset_index())
   index         gi  ptt_loc
0      0  384444683      593
1      4  384444684      594
2      9  384444686      596

print(df.reset_index().set_index('index', drop=False))
       index         gi  ptt_loc
index
0          0  384444683      593
4          4  384444684      594
9          9  384444686      596

And if you want to get rid of the index label you can do:

如果你想摆脱索引标签，你可以这样做：

df2 = df.reset_index().set_index('index', drop=False)
df2.index.name = None
print(df2)
   index         gi  ptt_loc
0      0  384444683      593
4      4  384444684      594
9      9  384444686      596

Answer 5

回答by jpp

`rename_axis`+ `reset_index`

You can first rename your index to a desired label, thenelevate to a series:

您可以先将索引重命名为所需的标签，然后提升为系列：

df = df.rename_axis('index1').reset_index()

print(df)

   index1         gi  ptt_loc
0       0  384444683      593
1       1  384444684      594
2       2  384444686      596

This works also for MultiIndexdataframes:

这也适用于MultiIndex数据框：

print(df)
#                        val
# tick       tag obs        
# 2016-02-26 C   2    0.0139
# 2016-02-27 A   2    0.5577
# 2016-02-28 C   6    0.0303

df = df.rename_axis(['index1', 'index2', 'index3']).reset_index()

print(df)

       index1 index2  index3     val
0  2016-02-26      C       2  0.0139
1  2016-02-27      A       2  0.5577
2  2016-02-28      C       6  0.0303

Answer 6

回答by Avneesh Hota

df1 = pd.DataFrame({"gi":[232,66,34,43],"ptt":[342,56,662,123]})
p = df1.index.values
df1.insert( 0, column="new",value = p)
df1

    new     gi     ptt
0    0      232    342
1    1      66     56 
2    2      34     662
3    3      43     123

Answer 7

回答by maria_g

A very simple way of doing this is to use reset_index() method.For a data frame df use the code below:

一个非常简单的方法是使用 reset_index() 方法。对于数据框 df 使用以下代码：

df.reset_index(inplace=True)

This way, the index will become a column, and by using inplace as True,this become permanent change.

这样，索引将成为一列，并且通过使用 inplace 为 True，这将成为永久更改。

Python 如何将熊猫数据框的索引转换为列？

提问by msakya

采纳答案by behzad.nouri

回答by Apogentus

回答by Ted Petrou

回答by bunji

回答by jpp

`rename_axis`+ `reset_index`

`rename_axis`+ `reset_index`

回答by Avneesh Hota

回答by maria_g

相关推荐

最近更新

标签

Python 如何将熊猫数据框的索引转换为列？

提问by msakya

采纳答案by behzad.nouri

回答by Apogentus

回答by Ted Petrou

回答by bunji

回答by jpp

rename_axis+ reset_index

rename_axis+ reset_index

回答by Avneesh Hota

回答by maria_g

相关推荐

Python + 不支持的操作数类型：'int' 和 'str'

Python 将项目添加到 pandas.Series？

通用装饰器来包装尝试，除了在 python 中？

Python PIL NameError 全局名称 未定义图像

相关推荐

最近更新

标签

`rename_axis`+ `reset_index`

`rename_axis`+ `reset_index`

Python PIL NameError 全局名称未定义图像