Re: Dataset for Hive

2015-04-01 Thread Chao Sun
Hi Xiaohe, You can try TPC-DS from https://github.com/hortonworks/hive-testbench. It contains large number of queries with complex joins. Chao On Wed, Apr 1, 2015 at 9:30 PM, xiaohe lan wrote: > Hi All, > > I am new to Hive. Just set up a 5 node Hadoop environment and want to have > a try on H

Dataset for Hive

2015-04-01 Thread xiaohe lan
Hi All, I am new to Hive. Just set up a 5 node Hadoop environment and want to have a try on HiveQL. Is there any dataset I can download to play HiveQL. The dataset should have several tables some I can write some complex join. About 100G should be fine. Thanks, Xiaohe