PySpark 中是否有与 scikit-learn 的 sample_weight 等效的参数?

本文介绍了PySpark 中是否有与 scikit-learn 的 sample_weight 等效的参数?的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我目前正在使用 scikit-learn 库提供的 SGDClassifier.当我使用 fit 方法时，我可以设置 sample_weight 参数:

I am currently using the SGDClassifier provided by the scikit-learn library. When I use the fit method I can set the sample_weight parameter:

应用于单个样本的权重.如果没有提供，统一假设权重.这些权重将乘以class_weight(通过构造函数传递)如果 class_weight 是指定

我想切换到 PySpark 并使用 LogisticRegression 类.无论如何，我找不到类似于 sample_weight 的参数.有一个 weightCol 参数，但我认为它做了一些不同的事情.

I want to switch to PySpark and to use the LogisticRegression class. Anyway I cannot find a parameter similar to sample_weight. There is a weightCol parameter but I think it does something different.

你有什么建议吗?

中是否有与

PySpark 中是否有与 scikit-learn 的 sample_weight 等效的参数?

问题描述

推荐答案