问题描述
对于WHERE子句中指定了分区键的二级索引查询,二级索引查询是命中所有集群节点,还是只命中指定分区键的节点?
如果后者是正确的,那么二级索引也很适合高基数字段(仅适用于满足分区键的查询).
For secondary index queries that the partition key is specified in the WHERE clause, does the secondary index lookup hits all cluster nodes, or just the node of the specified partition key?
If the latter is correct, then secondary index will be a good fit also for high cardinality fields (only for queries that satisfies the partition key).
例如,对于以下提要架构,查询特定提要(指定提要 ID)以检索现有或删除的提要项目应该非常有效:
For example, for the following feed schema, query of a specific feed (feed_id specified) to retrieve existing or deleted feed items should be very efficient:
CREATE TABLE my_feed (
feed_id int,
item_id timeuuid,
is_deleted boolean,
data text,
PRIMARY KEY (feed_id, item_id)
) WITH CLUSTERING ORDER BY (item_id DESC);
CREATE INDEX my_feed_is_deleted_idx ON my_feed (is_deleted);
==> SELECT * FROM my_feed WHERE feed_id=1 AND is_deleted=false; --efficient?
推荐答案
如果你先点击了一个分区键,那么它不会是一个集群范围的操作.只会命中目标分区.如果分区中有很多行的宽行,二级索引将是一种在分区被命中后过滤它们的有效方法.
If you hit a partition key first, then it won't be a cluster wide operation. Only the target partition will be hit. If you have wide rows with many rows in a partition, a secondary index will be an efficient way to filter them down once a partition is hit.
这篇关于如果指定了分区键,Cassandra 二级索引是否被优化?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持!