下载Hadoop的大数据

下载Hadoop的大数据

本文介绍了下载Hadoop的大数据的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我需要一个大的数据(超过10GB)运行Hadoop演示。任何人都知道我可以下载它。请告诉我。

I need a large data (more than 10GB) to run Hadoop demo. Anybody known where I can download it. Please let me know.

推荐答案

我建议您从以下网站下载数百万首歌曲数据集:

I would suggest you downloading million songs Dataset from the following website:

百万乐曲数据集的最好的事情是,您可以向Hadoop集群下载1GB(约10000首歌曲),10GB,50GB或约300GB数据集,并进行所需的任何测试。我喜欢使用它,并学习了很多使用这个数据集。

The best thing with Millions Songs Dataset is that you can download 1GB (about 10000 songs), 10GB, 50GB or about 300GB dataset to your Hadoop cluster and do whatever test you would want. I love using it and learn a lot using this data set.

要开始,您可以从AZ的任何一个字母下载数据集,范围从1GB到20GB ..您还可以使用Infochimp站点:

To start with you can download dataset start with any one letter from A-Z, which will be range from 1GB to 20GB.. you can also use Infochimp site:

在我的下一个博客中,我展示了如何下载1GB数据集并运行Pig脚本:

In one of my following blog I showed how to download 1GB dataset and run Pig scripts:

这篇关于下载Hadoop的大数据的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持!

08-24 03:13