python - 如何将 sklearn 的 TransformedTargetRegressor 与自定义数据转换器一起使用?
问题描述
我想使用这个答案sklearn.compose.TransformedTargetRegressor
中的说明。但是,变压器是定制的,我遇到了错误。
在这个最小的例子中,目标值应该乘以 10,然后在预测时再除以 10。(在我的实际应用中,目标值必须从非数字格式转换为数字格式。)
import numpy as np
import sklearn
from sklearn.compose import TransformedTargetRegressor
from sklearn.linear_model import LinearRegression
class MyTransform(sklearn.base.TransformerMixin):
def fit(self, *_, **__):
return self
def transform(self, X):
return np.array(X)*10
def inverse_transform(self, X):
return np.array(X)/10
def MyLinearRegression():
return TransformedTargetRegressor(
regressor=LinearRegression(),
transformer=MyTransform()
)
if __name__ == '__main__':
model = MyLinearRegression()
model.fit(X=[[1], [2], [3]], y=[1, 2, 3]) # raises TypeError
这提出了:
Traceback (most recent call last):
File "<input>", line 1, in <module>
File "C:\Program Files\JetBrains\PyCharm Community Edition 2019.3.3\plugins\python-ce\helpers\pydev\_pydev_bundle\pydev_umd.py", line 197, in runfile
pydev_imports.execfile(filename, global_vars, local_vars) # execute the script
File "C:\Program Files\JetBrains\PyCharm Community Edition 2019.3.3\plugins\python-ce\helpers\pydev\_pydev_imps\_pydev_execfile.py", line 18, in execfile
exec(compile(contents+"\n", file, 'exec'), glob, loc)
File "C:/Users/me/.PyCharmCE2019.3/config/scratches/scratch.py", line 26, in <module>
model.fit(X=[[1], [2], [3]], y=[1, 2, 3]) # raises TypeError
File "C:\Users\me\.virtualenvs\project--3333Ox_\lib\site-packages\sklearn\compose\_target.py", line 185, in fit
self._fit_transformer(y_2d)
File "C:\Users\me\.virtualenvs\project--3333Ox_\lib\site-packages\sklearn\compose\_target.py", line 127, in _fit_transformer
self.transformer_ = clone(self.transformer)
File "C:\Users\me\.virtualenvs\project--3333Ox_\lib\site-packages\sklearn\base.py", line 64, in clone
raise TypeError("Cannot clone object '%s' (type %s): "
TypeError: Cannot clone object '<__main__.MyTransform object at 0x000001653B2FA9A0>' (type <class '__main__.MyTransform'>): it does not seem to be a scikit-learn estimator as it does not implement a 'get_params' methods.
解决方案
您只需要继承 sklearn.base.BaseEstimator 以及 transformermixin :)。类型错误说:
它似乎不是 scikit-learn 估计器
所以你只需要让它成为一个:D。下面的代码应该可以工作。
import numpy as np
import sklearn
from sklearn.compose import TransformedTargetRegressor
from sklearn.linear_model import LinearRegression
class MyTransform(sklearn.base.BaseEstimator, sklearn.base.TransformerMixin):
def fit(self, *_, **__):
return self
def transform(self, X):
return np.array(X)*10
def inverse_transform(self, X):
return np.array(X)/10
def MyLinearRegression():
return TransformedTargetRegressor(
regressor=LinearRegression(),
transformer=MyTransform()
)
model = MyLinearRegression()
model.fit(X=[[1], [2], [3]], y=[1, 2, 3])
推荐阅读
- deep-learning - 无法修复 SystemExit:协作中的 2 错误
- c - C - 使用自定义文件名创建新文件
- qt - 为什么 Qt 嵌入式应用程序显示黑屏?
- openhtmltopdf - openhtmltopdf.pdfboxout 不支持相关属性 HTML TO PDF - Java
- python - 如何避免高 DPI 的 Windows 10 自动缩放?
- python - 我想在 django 中将 pdf 与 httpresponse 合并
- javascript - 如何在不提及 javascript 中的名称的情况下访问对象内部命名对象内部的重复元素数据?
- python - 遍历一个元组以形成一个字符串
- revit-api - 无法更新地形材质
- python-3.x - 遍历 SRT 文件直到找到索引