我正在使用druid io 0.9.0。我正在尝试添加后聚合字段作为指标规格。我的意图是要显示聚合后字段的值,类似于显示指标(度量)的方式(在Druid io中使用Pivot)。
我的Druid io模式文件是
{
"dataSources" : {
"NPS1112" : {
"spec" : {
"dataSchema" : {
"dataSource" : "NPS1112",
"parser" : {
"type" : "string",
"parseSpec" : {
"timestampSpec" : {
"column" : "timestamp",
"format" : "auto"
},
"dimensionsSpec" : {
"dimensions" : ["dimension1","dimension2","dimension3"],
"dimensionExclusions" : [
"timestamp",
"OverallRating",
"DeliveryTimeRating",
"ItemQualityRating",
"isPromoter",
"isDetractor"
]
},
"format" : "json"
}
},
"granularitySpec" : {
"type" : "uniform",
"segmentGranularity" : "hour",
"queryGranularity" : "none"
},
"aggregations" : [
{ "type" : "count", "name" : "rows"},
{ "type" : "doubleSum", "name" : "CountOfPromoters", "fieldName" : "isPromoter" },
{ "type" : "doubleSum", "name" : "CountOfDetractor", "fieldName" : "isDetractor" }
],
"postAggregations" : [
{ "type" : "arithmetic",
"name" : "PromoterPercentage",
"fn" : "/",
"fields" : [
{ "type" : "fieldAccess", "name" : "CountOfPromoters", "fieldName" : "CountOfPromoters" },
{ "type" : "fieldAccess", "name" : "rows", "fieldName" : "rows" }
]
},
{ "type" : "arithmetic",
"name" : "DetractorPercentage",
"fn" : "/",
"fields" : [
{ "type" : "fieldAccess", "name" : "CountOfDetractor", "fieldName" : "CountOfDetractor" },
{ "type" : "fieldAccess", "name" : "rows", "fieldName" : "rows" }
]
},
{ "type" : "arithmetic",
"name" : "NPS",
"fn" : "-",
"fields" : [
{ "type" : "fieldAccess", "name" : "PromoterPercentage", "fieldName" : "PromoterPercentage" },
{ "type" : "fieldAccess", "name" : "DetractorPercentage", "fieldName" : "DetractorPercentage" }
]
}
],
"metricsSpec" : [
{
"type" : "count",
"name" : "CountOfResponses"
},
{
"type" : "fieldAccess",
"name" : "CountOfPromoters"
}
]
},
"ioConfig" : {
"type" : "realtime"
},
"tuningConfig" : {
"type" : "realtime",
"maxRowsInMemory" : "10000",
"intermediatePersistPeriod" : "PT10M",
"windowPeriod" : "PT10M"
}
},
"properties" : {
"task.partitions" : "1",
"task.replicants" : "1"
}
}
},
"properties" : {
"zookeeper.connect" : "localhost",
"druid.discovery.curator.path" : "/druid/discovery",
"druid.selectors.indexing.serviceName" : "druid/overlord",
"http.port" : "8200",
"http.threads" : "4"
}
}
我的代码,用于使用Java客户端发送字段。
final Map<String,Object> obj = new HashMap<String, Object>();
obj.put("timestamp", new DateTime().toString());
obj.put("OverallRating", (ran.nextInt(high-low) + low));
obj.put("DeliveryTimeRating", (ran.nextInt(high-low) + low));
obj.put("ItemQualityRating", (ran.nextInt(high-low) + low));
obj.put("isPromoter", ((ran.nextInt(high-low) + low)%2) == 0 ? 1 : 0);
obj.put("isDetractor", ((ran.nextInt(high-low) + low)%2) == 0 ? 1 : 0);
obj.put("dimension1", "dimension1-"+ (ran.nextInt(high-low) + low));
obj.put("dimension2", "dimension2-"+ (ran.nextInt(high-low) + low));
obj.put("dimension3", "dimension3-"+ (ran.nextInt(high-low) + low));
谁能指出我的错误。
最佳答案
我不知道您是否可以在摄入规格中做到这一点(我实际上想知道是否可以!),但是您可以在数据透视配置中添加后期汇总。据我了解,后期聚合实际上是druid查询的一部分。
首先,使用数据透视表生成配置文件:
pivot --druid your.druid.broker.host:8082 --print-config --with-comments > config.yaml
然后修改config.yaml。语法完全不同,但是您可以很容易地组合聚合器。这是config.yaml文件中提供的示例:
# This is the place where you might want to add derived measures (a.k.a Post Aggregators).
#
# Here are some examples of possible derived measures:
#
# - name: ecpm
# title: eCPM
# expression: $main.sum($revenue) / $main.sum($impressions) * 1000
#
# - name: usa_revenue
# title: USA Revenue
# expression: $main.filter($country == 'United States').sum($revenue)
最后,使用
--config
标志运行数据透视pivot --config config.yaml
希望能有所帮助! :)
关于java - 如何在Druid io中将Post Aggregation值字段添加为指标,我们在Stack Overflow上找到一个类似的问题:https://stackoverflow.com/questions/37655938/