问题描述
我有一个用例,其中在BigQuery中有一些表。
现在,我想在BigQuery表的某一列上实现索引。
但是我找不到足够的文档来做到这一点。我发现一些博客和帖子中提到BigQuery不支持索引。
请帮助我找到一个博客或帖子,以帮助我在BigQuery上实现索引。
I have a use case in which we have a few tables in BigQuery.Now I want to implement an index on one of the columns in the BigQuery table.But I am not finding enough documentation to do that. I found a few blogs and posts mentioning BigQuery doesn't support indexes.Please help me find a blog or post which can help me in implementing index on BigQuery. Thanks in advance.
推荐答案
2019更新:看看c :
2019 update: Check out how clusters improve your querying times and data scanned:
- https://medium.com/google-cloud/bigquery-optimized-cluster-your-tables-65e2f684594b
如评论中所述,此问题与如果BigQuery数据大100倍,BigQuery将如何处理我的数据。在处理传统数据库时,索引是正确的解决方案,但BigQuery却有所不同:随着数据量的增长,BigQuery会添加更多服务器,从而使性能几乎保持不变。
As stated in the comments this question is associated with "how would BigQuery deal with my data if it was a 100 times larger". When dealing with traditional databases an index is the right solution, but BigQuery is different: As data size grows, BigQuery adds more servers to the mix - keeping performance almost constant.
换句话说,随着数据的增长,您应该期望成本线性增加,而性能几乎保持不变。无需索引。这就是人们选择BigQuery进行分析工作的主要原因之一。
In other words, as your data grows you should expect costs to increase linearly, with performance staying almost constant. No indexes needed. And this is one of the big reasons why people choose BigQuery for their analytical workloads.
(当然,这完全取决于您的特定用例,请测试这些断言并报告返回!)
(It all depends on your specific use case of course, please test these assertions and report back!)
这篇关于BigQuery表上的索引的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持!