pandas 将 OLS 回归摘要打印到文本文件
声明:本页面是StackOverFlow热门问题的中英对照翻译,遵循CC BY-SA 4.0协议,如果您需要使用它,必须同样遵循CC BY-SA许可,注明原文地址和作者信息,同时你必须将它归于原作者(不是我):StackOverFlow
原文地址: http://stackoverflow.com/questions/36360932/
Warning: these are provided under cc-by-sa 4.0 license. You are free to use/share it, But you must attribute it to the original authors (not me):
StackOverFlow
Print OLS regression summary to text file
提问by Stefano Potter
I am running OLS regression using pandas.stats.api.ols
using a groupby
with the following code:
我正在pandas.stats.api.ols
使用groupby
带有以下代码的a运行 OLS 回归:
from pandas.stats.api import ols
df=pd.read_csv(r'F:\file.csv')
result=df.groupby(['FID']).apply(lambda d: ols(y=d.loc[:, 'MEAN'], x=d.loc[:, ['Accum_Prcp', 'Accum_HDD']]))
for i in result:
x=pd.DataFrame({'FID':i.index, 'delete':i.values})
frame = pd.concat([x,DataFrame(x['delete'].tolist())], axis=1, join='outer')
del frame['delete']
print frame
but this returns the error:
但这会返回错误:
AttributeError: 'OLS' object has no attribute 'index'
I have about 2,000 items in my group by and when I print each one out they look something like this:
我的小组中有大约 2,000 件物品,当我打印出每件物品时,它们看起来像这样:
-
——
------------------------Summary of Regression Analysis-------------------------
Formula: Y ~ <Accum_Prcp> + <Accum_HDD> + <intercept>
Number of Observations: 79
Number of Degrees of Freedom: 3
R-squared: 0.1242
Adj R-squared: 0.1012
Rmse: 0.1929
F-stat (2, 76): 5.3890, p-value: 0.0065
Degrees of Freedom: model 2, resid 76
-----------------------Summary of Estimated Coefficients------------------------
Variable Coef Std Err t-stat p-value CI 2.5% CI 97.5%
--------------------------------------------------------------------------------
Accum_Prcp 0.0009 0.0003 3.28 0.0016 0.0004 0.0015
Accum_HDD 0.0000 0.0000 1.98 0.0516 0.0000 0.0000
intercept 0.4750 0.0811 5.86 0.0000 0.3161 0.6340
---------------------------------End of Summary---------------------------------
I want to be able to export each one to a csv so that I can view them individually.
我希望能够将每个文件导出到 csv,以便我可以单独查看它们。
采纳答案by Stefan
In order to write out the result
of pandas.stats.api.ols
, use a text file to match the output format, for instance:
为了写出result
of pandas.stats.api.ols
,请使用文本文件来匹配输出格式,例如:
from pandas.stats.api import ols
grps = df.groupby(['FID'])
for fid, grp in grps:
result = ols(y=grp.loc[:, 'MEAN'], x=grp.loc[:, ['Accum_Prcp', 'Accum_HDD']])
text_file = open("Output {}.txt".format(fid), "w")
text_file.write(result.summary)
text_file.close()
回答by Anton Tarasenko
As of statsmodels 0.9
, the Summary
class supports exportto multiple formats, including CSV and text:
截至statsmodels 0.9
,Summary
该类支持导出为多种格式,包括 CSV 和文本:
import numpy as np
import statsmodels.api as sm
import statsmodels.formula.api as smf
dat = sm.datasets.get_rdataset("Guerry", "HistData").data
results = smf.ols('Lottery ~ Literacy + np.log(Pop1831)', data=dat).fit()
with open('summary.txt', 'w') as fh:
fh.write(results.summary().as_text())
with open('summary.csv', 'w') as fh:
fh.write(results.summary().as_csv())
The output of as_csv()
is not machine-readable. Dumping results
parameters with repr()
would be.
的输出as_csv()
不是机器可读的。转储results
参数repr()
将是。