问题描述
我正在使用正在运行的可伸缩性问题的数据库模式。架构中的表之一已增长到大约1000万行,我正在探索分片和分区选项,以允许此架构扩展到更大的数据集(例如10亿到100亿行)。我们的应用程序还必须可以部署到多个数据库产品,包括但不限于Oracle,MS SQL Server和MySQL。
这是一个大问题,喜欢阅读什么选项可用。
我同意其他资源答案,你应该看看你的架构和索引,然后诉诸分片。 1000万行是任何主要数据库引擎的能力。
然而,如果你想要一些资源来学习分片主题,那么试试这些: p>
I'm working with a database schema that is running into scalability issues. One of the tables in the schema has grown to around 10 million rows, and I am exploring sharding and partitioning options to allow this schema to scale to much larger datasets (say, 1 billion to 100 billion rows). Our application must also be deployable onto several database products, including but not limited to Oracle, MS SQL Server, and MySQL.
This is a large problem in general, and I'd like to read up on what options are available. What resources are out there (books, whitepapers, web sites) for database sharding and partitioning strategies?
I agree with the other answers that you should look at your schema and indexes before resorting to sharding. 10 million rows is well within the capabilities of any of the major database engines.
However if you want some resources for learning about the subject of sharding then try these:
- Scalability Best Practices: Lessons from eBay
- Randy Shoup on eBay's Architectural Principles - Video and Presentation
- High Scalability Site
- Mr. Moore gets to punt on sharding (when not to do it)
这篇关于数据库分片和分区的资源的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持!