pandas 如何在 Python Numpy 中使用 train_test_split 修复值错误

Question

提问by python_beginner

I am using sklearn with a numpy array. I have 2 arrays (x, y) and they should be:

我正在使用带有 numpy 数组的 sklearn。我有 2 个数组 (x, y)，它们应该是：

test_size=0.2
train_size=0.8

This is my current code:

这是我当前的代码：

def predict():

    sample_data = pd.read_csv("includes\csv.csv")

    x = np.array(sample_data["day"])
    y = np.array(sample_data["balance"])


    x = x.reshape(1, -1)



    y = y.reshape(1, -1)




    print(x)
    print(y)



    X_train, X_test, y_train, y_test = train_test_split(x, y, test_size=0.2)



    clf = LinearRegression()
    clf.fit(x_train, y_train)

    clf.score(x_test, y_test)

The error is:

错误是：

ValueError: With n_samples=1, test_size=0.2 and train_size=None, the resulting train set will be empty. Adjust any of the aforementioned parameters.

, and it appears in the line:

，它出现在以下行中：

X_train, X_test, y_train, y_test = train_test_split(x, y, test_size=0.2)

Any ideas why that appears?

任何想法为什么会出现？

Answer 1

回答by Alejandro Aldana

I had that problem. Check the library "scikit-learn". sklearn have problems with the version 0.20.0+ of scikt-learn, try to do:

我有那个问题。检查库“scikit-learn”。sklearn scikt-learn 0.20.0+版本有问题，尝试做：

Windows: pip uninstall scikit-learn
Linux: sudo python36 -m pip uninstall scikit-learn

视窗：pip uninstall scikit-learn
Linux：sudo python36 -m pip uninstall scikit-learn

and install:

并安装：

Windows: pip install scikit-learn==0.19.1
Linux: sudo python36 -m pip install scikit-learn==0.19.1

视窗：pip install scikit-learn==0.19.1
Linux：sudo python36 -m pip install scikit-learn==0.19.1

pandas 如何在 Python Numpy 中使用 train_test_split 修复值错误

提问by python_beginner

回答by Alejandro Aldana

相关推荐

最近更新

标签

pandas 如何在 Python Numpy 中使用 train_test_split 修复值错误

提问by python_beginner

回答by Alejandro Aldana

相关推荐

pandas.errors.ParserError: ',' 预期在 '"' 之后

pandas 熊猫重命名索引

使用 Pandas 将 CSV 读入具有不同行长的数据帧

将 Pandas Dataframe 转换为 numpy 数组

相关推荐

最近更新

标签