XML 到 Pandas 数据框

Question

提问by root

I have a XML file with thousands of lines like:

我有一个包含数千行的 XML 文件，例如：

<Word x1="206" y1="120" x2="214" y2="144" font="Times-Roman" style="font-size:22pt">WORD</Word>

I want to convert it (all it's attributes) to pandasdataframe. To do that i could loop through the file using beautiful soup and insert the values row by row or create lists to be inserted as columns. However I would like to know if there's a more pythonic way of accomplishing what I described. Thank you in advance.

我想将它（所有它的属性）转换为pandasdataframe. 为此，我可以使用漂亮的汤遍历文件并逐行插入值或创建要作为列插入的列表。但是，我想知道是否有更pythonic 的方式来完成我所描述的。先感谢您。

Code example:

代码示例：

x1list=[]
x2list=[]

for word in soup.page.findAll('word'):
    x1list.append(int(word['x1']))
    x2list.append(int(word['x2']))
df=DataFrame({'x1':x1list,'x2':x2list})

Answer 1

采纳答案by eumiro

Try this:

尝试这个：

DataFrame.from_records([(int(word['x1']), int(word['x2']))
                        for word in soup.page.findAll('word')],
                       columns=('x1', 'x2'))

XML 到 Pandas 数据框

提问by root

采纳答案by eumiro

相关推荐

最近更新

标签

XML 到 Pandas 数据框

提问by root

采纳答案by eumiro

相关推荐

如何使用 Pandas 获得两个时间序列之间的相关性

为什么 2012 年 Python 中的 Pandas 合并速度比 R 中的 data.table 合并速度快？

在 IPython 中使用 Pandas 绘制股票图表

pandas 开源 Enthought Python 替代方案

相关推荐

最近更新

标签