本文介绍了如何获取XGBClassifier的预测p值?的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我想知道XGBClassifier对它所做的每个预测有多自信.是否有可能具有这样的价值?还是predict_proba已经间接地建立了模型的可信度?

I'd like to know how confident an XGBClassifier is for each prediction it makes. Is it possible to have such a value? Or is the predict_proba already indirectly the confidence of the model?

推荐答案

您的直觉确实是正确的:predict_proba返回每个示例属于给定类的概率;来自 docs :

Your intuition is indeed correct: predict_proba returns the probability of each example being of a given class; from the docs:

预测每个数据示例属于给定类的概率.

Predict the probability of each data example being of a given class.

实际上,该概率反过来通常在惯例中被解释为预测的置信度.

This probability in turn is routinely interpreted in practice as the confidence of the prediction.

也就是说,这是一种特殊的,实用的解释,与p值或任何其他统计严谨度无关.一般来说,对于AFAIK而言,没有(或类似的)机器学习技术可采用的此类措施.

That said, this is an ad-hoc, practical interpretation, and it has nothing to do with p-values or any other measure of statistical rigour; generally speaking and AFAIK, there are no such measures available for this (and similar) machine learning techniques.

更笼统地说,您可能想知道p值本身已迅速在统计学家中失宠;一些快速链接:

On a more general level, you may be interested to know that p-values themselves have been quickly falling out of grace among statisticians; some quick links:

统计学家问题警告错误使用 P (自然)

p值的问题不仅仅是带有p值(Andrew Gelman @ American Statistician)

The problems with p-values are not just with p-values (Andrew Gelman @ American Statistician)

p值存在问题(指向数据科学博客文章)

The problem with p-values (Towards Data Science blog post)

这篇关于如何获取XGBClassifier的预测p值?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持!

10-24 16:03