Python 如何从 namedtuple 实例列表创建 Pandas DataFrame（带有索引或多索引）？

Question

提问by MikeRand

Simple example:

简单的例子：

>>> from collections import namedtuple
>>> import pandas

>>> Price = namedtuple('Price', 'ticker date price')
>>> a = Price('GE', '2010-01-01', 30.00)
>>> b = Price('GE', '2010-01-02', 31.00)
>>> l = [a, b]
>>> df = pandas.DataFrame.from_records(l, index='ticker')
Traceback (most recent call last)
...
KeyError: 'ticker'

Harder example:

更难的例子：

>>> df2 = pandas.DataFrame.from_records(l, index=['ticker', 'date'])
>>> df2

         0           1   2
ticker  GE  2010-01-01  30
date    GE  2010-01-02  31

Now it thinks that ['ticker', 'date']is the index itself, rather than the columns I want to use as the index.

现在它认为这['ticker', 'date']是索引本身，而不是我想用作索引的列。

Is there a way to do this without resorting to an intermediate numpy ndarray or using set_indexafter the fact?

有没有办法在不诉诸中间 numpy ndarray 或set_index事后使用的情况下做到这一点？

Answer 1

采纳答案by Andy Hayden

To get a Series from a namedtuple you could use the _fieldsattribute:

要从命名元组中获取系列，您可以使用该_fields属性：

In [11]: pd.Series(a, a._fields)
Out[11]:
ticker            GE
date      2010-01-01
price             30
dtype: object

Similarly you can create a DataFrame like this:

同样，您可以像这样创建一个 DataFrame：

In [12]: df = pd.DataFrame(l, columns=l[0]._fields)

In [13]: df
Out[13]:
  ticker        date  price
0     GE  2010-01-01     30
1     GE  2010-01-02     31

You have to set_indexafter the fact, but you can do this inplace:

你必须set_index事后，但你可以这样做inplace：

In [14]: df.set_index(['ticker', 'date'], inplace=True)

In [15]: df
Out[15]:
                   price
ticker date
GE     2010-01-01     30
       2010-01-02     31

Python 如何从 namedtuple 实例列表创建 Pandas DataFrame（带有索引或多索引）？

提问by MikeRand

采纳答案by Andy Hayden

相关推荐

最近更新

标签

Python 如何从 namedtuple 实例列表创建 Pandas DataFrame（带有索引或多索引）？

提问by MikeRand

采纳答案by Andy Hayden

相关推荐

Python ValueError: max() arg 是一个空序列

Python 导入错误：没有名为 PyQt4 的模块

Python 错误 - int 对象没有属性

Python - 从日期时间字符串中删除时间

相关推荐

最近更新

标签