pandas 未知标签类型 sklearn

Question

提问by Руслан Вергунов

I 'm new in sklearn. I 'm trying to do this code

我是 sklearn 的新手。我正在尝试执行此代码

data = pandas.read_csv('titanic.csv')
data= data[data['Pclass'].notnull() & data['Sex'].notnull() &         data['Age'].notnull() & data['Fare'].notnull()]   
test = data.loc[:,['Pclass','Sex','Age','Fare']]
target = data.loc[:,['Survived']]
test = test.replace(to_replace=['male','female'],value=[1,0])
clf=DecisionTreeClassifier(random_state=241)
clf.fit(target,test)

And I saw this error

我看到了这个错误

ValueError: Unknown label type: array([[ 22.    ,   3.    ,   7.25  ,        1.    ],
   [ 38.    ,   1.    ,  71.2833,   0.    ],
   [ 26.    ,   3.    ,   7.925 ,   0.    ],
   ..., 
   [ 19.    ,   1.    ,  30.    ,   0.    ],
   [ 26.    ,   1.    ,  30.    ,   1.    ],
   [ 32.    ,   3.    ,   7.75  ,   1.    ]])

ValueError: Unknown label type: array([[ 22.    ,   3.    ,   7.25  ,        1.    ],
   [ 38.    ,   1.    ,  71.2833,   0.    ],
   [ 26.    ,   3.    ,   7.925 ,   0.    ],
   ..., 
   [ 19.    ,   1.    ,  30.    ,   0.    ],
   [ 26.    ,   1.    ,  30.    ,   1.    ],
   [ 32.    ,   3.    ,   7.75  ,   1.    ]])

What is a problem?

什么是问题？

Answer 1

采纳答案by Nickil Maveli

You are currently providing a dataframe and not it's numpy array representation as the training input to the fitmethod. Do this instead:

您当前提供的是一个数据帧，而不是它的 numpy 数组表示作为该fit方法的训练输入。改为这样做：

clf.fit(X=test.values, y=target.values)   
# Even .asmatrix() works but is not generally recommended

pandas 未知标签类型 sklearn

提问by Руслан Вергунов

采纳答案by Nickil Maveli

相关推荐

最近更新

标签

pandas 未知标签类型 sklearn

提问by Руслан Вергунов

采纳答案by Nickil Maveli

相关推荐

pandas DataFrame：如何使用自定义方式剪切数据框？

pandas 处理标签编码的未知值

“未指定驱动程序名称”将 Pandas 数据框写入 SQL Server 表

df.where( ) 和 df [ (df [ ] == ) ] 在 Pandas 中的区别，python

相关推荐

最近更新

标签