pandas SKLearn MinMaxScaler - 仅缩放特定列

Question

提问by lte__

I'd like to scale some (but not all) of the columns in a Pandas dataFrame using a MinMaxScaler. How can I do it?

我想使用 MinMaxScaler 缩放 Pandas 数据帧中的一些（但不是全部）列。我该怎么做？

Answer 1

采纳答案by Random

Since sklearn >= 0.20 you can do it using Column Transformer

由于 sklearn >= 0.20 你可以使用Column Transformer

standard_transformer = Pipeline(steps=[
        ('standard', StandardScaler())])

minmax_transformer = Pipeline(steps=[
        ('minmax', MinMaxScaler())])


preprocessor = ColumnTransformer(
        remainder='passthrough', #passthough features not listed
        transformers=[
            ('std', standard_transformer , ['z']),
            ('mm', minmax_transformer , ['x','y'])
        ])

Answer 2

回答by MaxU

Demo:

演示：

In [90]: df = pd.DataFrame(np.random.randn(5, 3), index=list('abcde'), columns=list('xyz'))

In [91]: df
Out[91]:
          x         y         z
a -0.325882 -0.299432 -0.182373
b -0.833546 -0.472082  1.158938
c -0.328513 -0.664035  0.789414
d -0.031630 -1.040802 -1.553518
e  0.813328  0.076450  0.022122

In [92]: from sklearn.preprocessing import MinMaxScaler

In [93]: mms = MinMaxScaler()

In [94]: df[['x','z']] = mms.fit_transform(df[['x','z']])

In [95]: df
Out[95]:
          x         y         z
a  0.308259 -0.299432  0.505500
b  0.000000 -0.472082  1.000000
c  0.306662 -0.664035  0.863768
d  0.486932 -1.040802  0.000000
e  1.000000  0.076450  0.580891

the same result can be also achieved using sklearn.preprocessing.minmax_scale:

同样的结果也可以使用sklearn.preprocessing.minmax_scale：

from sklearn.preprocessing import minmax_scale

df[['x','z']] = minmax_scale(df[['x','z']])

pandas SKLearn MinMaxScaler - 仅缩放特定列

提问by lte__

采纳答案by Random

回答by MaxU

相关推荐

最近更新

标签

pandas SKLearn MinMaxScaler - 仅缩放特定列

提问by lte__

采纳答案by Random

回答by MaxU

相关推荐

pandas 熊猫数据阅读器

pandas Panda Python - 将一列除以 100（然后四舍五入 2.dp）

pandas 熊猫将列转换为日期时间

将对象转换为 Int Pandas

相关推荐

最近更新

标签