algorithm - 蛋糕比较算法

这实际上是关于比较蛋糕。我的 friend 正在举办一个纸杯蛋糕派对，目的是确定曼哈顿最好的纸杯蛋糕店。实际上，它的野心远不止于此。继续阅读。

有 27 家面包店和 19 人参加(可能有一两个人没有出现)。每个面包店将有 4 个纸杯蛋糕，如果可能的话，包括主食—— Vanilla 、巧克力和红色天鹅绒——并用通配符口味将这 4 个蛋糕四舍五入。有 4 个属性可以用来评价纸杯蛋糕:味道、湿度、外观(漂亮)和一般的好坏。人们将为他们采样的每个纸杯蛋糕的每个属性提供 5 分制的评分。最后，每个纸杯蛋糕可以切成 4 或 5 块。

问题是:为每个属性和每种口味(将“通配符”视为一种口味)对面包店进行统计上有意义的排名的程序是什么？具体来说，我们希望对面包店进行 8 次排名:对于每种口味，我们希望按好坏对面包店进行排名(好坏是属性之一)，对于每个属性，我们希望对所有口味的面包店进行排名(即，独立于 flavor ) ，即聚合所有口味)。特等奖授予在善良属性上排名靠前的面包店。

当然，概括这一点的加分点。

这将在大约 12 小时内发生，因此如果在此期间没有人回答，我将发布作为答案的答案。

PS:这是关于它的派对后博客文章:http://gracenotesnyc.com/2009/08/05/gracenotes-nycs-cupcake-cagematch-the-sweetest-battle-ever/

最佳答案

这就是我们最终要做的。我做了一个巨大的表格来收集每个人在 http://etherpad.com/sugarorgy 上的评分(修订版 25，以防万一它被我添加了这个公共(public)链接而遭到破坏)，然后使用以下 Perl 脚本将数据解析为 CSV 文件:

#!/usr/bin/env perl
# Grabs the cupcake data from etherpad and parses it into a CSV file.

use LWP::Simple qw(get);

$content = get("http://etherpad.com/ep/pad/export/sugarorgy/latest?format=txt");
$content =~ s/^.*BEGIN_MAGIC\s*//s;
$content =~ s/END_MAGIC.*$//s;
$bakery = "none";
for $line (split('\n', $content)) {
  next if $line =~ /sar kri and deb/;
  if ($line =~ s/bakery\s+(\w+)//) { $bakery = $1; }
  $line =~ s/\([^\)]*\)//g; # strip out stuff in parens.
  $line =~ s/^\s+(\w)(\w)/$1 $2/;
  $line =~ s/\-/\-1/g;
  $line =~ s/^\s+//;
  $line =~ s/\s+$//;
  $line =~ s/\s+/\,/g;
  print "$bakery,$line\n";
}

然后我在 Mathematica 中做了平均和诸如此类的事情:

data = Import["!~/svn/sugar.pl", "CSV"];

(* return a bakery's list of ratings for the given type of cupcake *)
tratings[bak_, t_] := Select[Drop[First@Select[data,
                        #[[1]]==bak && #[[2]]==t && #[[3]]=="g" &], 3], #!=-1&]

(* return a bakery's list of ratings for the given cupcake attribute *)
aratings[bak_, a_] := Select[Flatten[Drop[#,3]& /@
                        Select[data, #[[1]]==bak && #[[3]]==a&]], #!=-1&]

(* overall rating for a bakery *)
oratings[bak_] := Join @@ (tratings[bak, #] & /@ {"V", "C", "R", "W"})

bakeries = Union@data[[All, 1]]

SortBy[{#, oratings@#, Round[Mean@oratings[#], .01]}& /@ bakeries, -#[[3]]&]

结果在 http://etherpad.com/sugarorgy 的底部。