问题描述
我们有大约300万条记录一个庞大的数据,这将得到更新每3-6 months.We需要查询这些数据(continously,实时)得到一些information.What是选项 - 关系数据库(MySQL的),或类似Hadoop.Which一些其他的选择将是更好的?
We have a huge data of about 300 million records, which will get updated every 3-6 months.We need to query this data(continously, real time) to get some information.What are the options - a RDBMS(mysql) , or some other option like Hadoop.Which will be better?
推荐答案
300M的记录是非常之内,如果你使用正确的索引常规关系数据库和实时查询的范围应该是没有问题的。
300M records is well within the bounds of regular relational databases and live querying should be no problem if you use indexes properly.
Hadoop的听起来像矫枉过正,除非你真的需要高度分散和冗余的数据,也将使其更难找到支持,如果你遇到麻烦或优化。
Hadoop sounds like overkill unless you really need highly distributed and redundant data, and it will also make it harder to find support if you run into trouble or for optimizations.
这篇关于大数据 - 存储和查询的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持!