评级为5星的更好方法是什么

评级为5星的更好方法是什么

本文介绍了评级为5星的更好方法是什么?的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我正在尝试使用5星系统按客户评分对一堆产品进行排序.我为此设置的网站收视率不高,并且会继续添加新产品,因此通常会有一些收视率低的产品.

I'm trying to sort a bunch of products by customer ratings using a 5 star system. The site I'm setting this up for does not have a lot of ratings and continue to add new products so it will usually have a few products with a low number of ratings.

我尝试使用平均星级,但是当评分很少时,该算法就会失败.

I tried using average star rating but that algorithm fails when there is a small number of ratings.

例如,具有3颗5颗星评级的产品比具有100颗5颗星评级和2颗2颗星评级的产品的显示效果更好.

Example a product that has 3x 5 star ratings would show up better than a product that has 100x 5 star ratings and 2x 2 star ratings.

第二个产品是否应该显示出较高的排名,因为从统计上讲,由于评分较高,第二个产品更值得信赖?

Shouldn't the second product show up higher because it is statistically more trustworthy because of the larger number of ratings?

推荐答案

2015年之前,互联网电影数据库(IMDb)公开列出了用于对其前250张电影列表.引用:

Prior to 2015, the Internet Movie Database (IMDb) publicly listed the formula used to rank their Top 250 movies list. To quote:

weighted rating (WR) = (v ÷ (v+m)) × R + (m ÷ (v+m)) × C

其中:

对于前250名,仅考虑常规选民的投票.

For the Top 250, only votes from regular voters are considered.

这并不难理解.公式是:

It's not so hard to understand. The formula is:

rating = (v / (v + m)) * R +
         (m / (v + m)) * C;

数学上可以简化为:

rating = (R * v + C * m) / (v + m);

变量为:

所有公式要做的是:在计算平均值之前,先添加m个虚拟投票,每个虚拟投票的值为C.开始时,如果没有足够的数据(即投票数大大少于m),则会导致空白区域填充平均数据.但是,随着票数的累积,最终虚构的票数将被真实的票数淹没.

All the formula does is: add m imaginary votes, each with a value of C, before calculating the average. In the beginning, when there isn't enough data (i.e. the number of votes is dramatically less than m), this causes the blanks to be filled in with average data. However, as votes accumulates, eventually the imaginary votes will be drowned out by real ones.

在此系统中,投票不会导致评分波动剧烈.取而代之的是,他们只是在某个方向上扰动了它.

In this system, votes don't cause the rating to fluctuate wildly. Instead, they merely perturb it a bit in some direction.

当票数为零时,仅虚数票存在,并且所有票数均为C.因此,每个项目都以等级C开头.

When there are zero votes, only imaginary votes exist, and all of them are C. Thus, each item begins with a rating of C.

另请参阅:

  • 演示.点击解决".
  • IMDb系统的另一个解释.
  • 类似贝叶斯星级评分系统的解释.
  • A demo. Click "Solve".
  • Another explanation of IMDb's system.
  • An explanation of a similar Bayesian star-rating system.

这篇关于评级为5星的更好方法是什么?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持!

08-06 15:46